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Description 
NOVEL BETA -DEFENS INS 

REFERENCE TO RELATED APPLICATIONS 

| This /application is a continuation-in-part of 

U.S. Patent Application No. 09/344,097, filed on June 25, 
1999, which a continuation in part of U.S. Patent 

Application^No. 09/150, 786, filed on September 10, 1998, 
which is related to Provisional Applications 60/058,335, 
filed on 'September 10, 1997 and 60/054,294, filed on 
November / 5, 1997. Under 35 U.S.C. 5119(e) (1), this 
application claims benefit of said Provisional 
Applications . 

BACKGROUND OF THE INVENTION 

Biological defense strategies have evolved to 
protect organisms from invasion by other species. 
Microbial infection response systems include oxidative and 
non-oxidative mechanisms, utilizing compounds that are 
enzymatically synthesized in cells and peptides that are 
single gene products. 

Anti-microbial peptides constitute an oxygen- 
independent host defense system found in organisms 
encompassing many taxonomic families. One major class of 
anti-microbial peptides can be sequence-defined by 
conserved cysteine residue patterns and are termed 
defensins. Mammalian defensins, derived from skin, lung 
and intestine, exhibit antibiotic activity against a wide 
variety of pathogens, including gram-positive and gram- 
negative bacteria, fungi ( e.g. Candida species) and 
viruses. See, for example, Porter et al . , Infect. Immun. 
65(6) : 2396-401, 1997. 

The amphipathic character of the defensin 
peptides appears to be the key to the general mechanism of 



microbial attack, i.e. , by creating pores, or "boring" 
through the cell wall. In addition, Daher et al . , J. 
Virol . 60 (3) : 1068-74, 1986, reported that enveloped 
viruses, including herpes simplex types 1 and 2, 
5 cytomegalovirus and influenza virus (A/WSN) , among others, 
were inactivate^ by incubation with human neutrophil 
peptide (HNP-1) and speculated that the binding of 
defensin molecules to viruses impairs the virus' ability 
to infect cells. 

J=i 10 The ' defensin family of anti -microbial peptides 

Cl can be divided into two major subclasses based on two 
f7 distinct consensus sequences. See, for example, Martin et 
M al., Journal of Leukocyte Biology 58 : 128-36, 1995. The 
JJ; first defensin subclass, classic defensins, represented by 
, IS HNPs are stored in the so-called large azurophil granules 
j=* of neutrophils and macrophages and attack microorganisms 
Pi that have been phagocytosed by these cells. The amino 
Ui .acid sequence of HNPs is consistent with a predicted 
= 5 disulfide bridging that is distinct from that of the (3- 
20 defensin subclass. Epithelial cells can also be a source 
of defensins, and these cells appear to secrete these 
peptides into the external, extra-cellular environment. 
In the mouse, for example, Paneth cells of the small 
intestine and proximal colon, secrete def ensin-like 
25 peptides, called cryptidins, into the lumen. See, for 
example, Ouellette and Selsted, The FASEB Journal 10 : 
1280-9, 1996. 

fi-def ensins , the second major defensin subclass, 
include peptides found in bovine lung ( e.g. , BNBD-bovine 
3 0 neutrophil |3-def ensins) as well as a secreted form (TAP - 
tracheal anti-microbial peptide) . See, for example, 
Selsted et al . , J. Biol. Chem. 268 (9) : 6641-8, 1993. Two 
human P-defensins have been reported. SAP-1 was isolated 
from human psoriatic skin, and hBD-1 was found in low 
35 concentrations in human blood filtrate. See, for example, 
Bensch et al., FEBS Lett. 368 (2) : 331-5, 1995). The amino 
acid sequence of these human P-defensins is most similar 



to the bovine BNDPs and TAP. See, for example, Harder et 
al., Nature 387 : 861, 1997, wherein SAP-1 is designated 
hBD-2. 

Other than the conserved cysteine residues the 
defensin family is quite sequence divergent. It is 
possible that the variant amino acid positions may be 
related to the site or conditions of activity or to the 
spectrum of pathogens attacked by a particular defensin. 

In addition to anti-microbial activities, 
particular defensins exhibit metabolically sensitive 
cytotoxic activity (Lichtenstein et al., Blood 68 : 1407- 
10, 1986 and Sheu et al . ■, Antimicrob. Agents Chemother. 
28: 626-9, 1993), alter the response of adrenal cortical 
cells to ACTH (Zhu. et al., Proc . Natl. Acad. Sci. (USA) 
85(2) : 592-6, 1988) and have specific chemotactic activity 
for human monocytes (Territo et al . , J. Clin. Invest. 
84 (6) : 2017-20, 1989) . Recruitment of monocytes by 
.neutrophils may, in part, be mediated by neutrophilic 
defensins and suggests a pro- inflammatory activity for 
these peptides in addition to their anti-microbial 
effects. Also, a decrease in defensin mRNA level has been 
demonstrated in SPG (specific granule disease) . See, for 
example, Tamura et al., Japan ■ Int . J . Hematol . 59(2) ■■ 
137-42, 1994. Higazi et al . , J. Biol . Chem. 271 (3) : 17650- 
5, 1996, suggested that plasminogen bound to fibrin in the 
presence of defensin may be less susceptible to activation 
by tPA. 

Moieties having anti-microbial, 

immunostimulatory, pro-inflammatory and other properties 
of defensins are sought. The present invention provides 
such polypeptides for these and other uses that should be 
apparent to those skilled in the art from the teachings 
herein. 



SUMMARY OF THE INVENTION 

| Within orte aspect the invention provides an 

isolated protein comprising a polypeptide that is at least 



80% identical to a polypeptide selected from the group 
consisting of: a) a polypeptide having the sequence of 
amino acid residue 1 to amino acid residue 65 of SEQ ID 
NO: 2; b) a polypeptide having the sequence of amino acid 
5 residue 19 to amino acid /residue 65 of SEQ ID NO: 2; c) a 
polypeptide having the sequence of amino acid residue 21 
to amino acid residue 65 of SEQ ID NO: 2; d) a polypeptide 
having the sequence of/ amino acid residue 1 to amino acid 
residue 67 of SEQ Id/nO:10; e) a polypeptide having the 

10 sequence of amino acid residue 21 to amino acid residue 67 
of SEQ ID NO: 10; and f) a polypeptide having the sequence 
of amino acid residue 23 to amino acid residue 67 of SEQ 
ID NO: 2; wherein the polypeptide has cysteine residues 
corresponding to amino acid residues 33, 40, 45, 55, 62 

15 and 63 of SEQ^/ID N0s:2 or 10. Within one embodiment the 
protein comprises a polypeptide having the sequence 
selected from the group consisting of: a) a polypeptide 
having the /sequence of amino acid residue 1 to amino acid 
residue 67 of SEQ ID NO: 10; b) a polypeptide having the 

20 sequence jot amino acid residue 21 to amino acid residue 67 
of SEQ /ID NO: 10; and c) a polypeptide having the sequence 
of amino acid residue 23 to amino acid residue 67 of SEQ 
ID NO: 10. 

Within another aspect is provided an isolated 
25 protein having the sequence of SEQ ID NO: 10 from amino 
acid residue 23 to amino acid residue 67. 

Within another aspect the invention provides a 
polypeptide selected from the group consisting of: .a) 
amino acid residue 30 to amino acid residue 63 of SEQ ID 
30 N0:2; b) amino acid residue 31 to amino acid residue 63 
of SEQ ID NO: 2; c) amino acid residue 30 to amino acid 
residue 64 of SEQ ID N0:2; d) amino acid residue 31 to 
amino acid residue 64 of SEQ ID N0-.2; and e) a polypeptide 
chosen from SEQ ID N0s:14-72. 
35 Within still another aspect is provided a 

pharmaceutical composition comprising a polypeptide 
selected from the group consisting of: a) a protein 



according to claim 1, b) amino acid residue 30 to amino 
acid residue 63 of SEQ ID NO:2; c) amino acid residue 31 
to amino acid residue 63 of SEQ ID NO: 2; d) amino acid 
residue 30 to amino acid residue 64 of SEQ ID NO:2; e) 
amino acid residue 31 to amino acid residue 64 of SEQ ID 
NO: 2; and f) a polypeptide chosen from SEQ ID NOs : 14-72; 
in combination with a pharmaceutical ly acceptable vehicle. 

Within yet another aspect is provided an 
antibody that specifically binds to a protein as described 



10 above. 




Within a further aspect is provided an anti- 
idiotype antibody of an antibody which specifically binds 
to a protein as described above . 

Within another aspect is provided an isolated 
polynucleotide molec/ule encoding a protein, the 
polynucleotide molecule consisting of a coding strand and 
a complementary / non-coding strand, wherein the 
polynucleotide molecule encodes a polypeptide that is at 
least 80% identical to the amino acid sequence to a 
20 polypeptide selected from the group consisting of: a) a 
polypeptide having the sequence of amino acid residue 1 to 
amino acid residue 65 of SEQ ID N0:2; b) a polypeptide 
having the sequence of amino acid residue 19 to amino acid 
residue 65 ok SEQ ID NO: 2; c) a polypeptide having the 
25 sequence of /amino acid residue 21 to amino acid residue 65 
of SEQ ID/NO:2; d) a polypeptide having the sequence of 
amino acid residue 1 to amino acid residue 67 of SEQ ID 
NO: 10; e/) a polypeptide having the sequence of amino acid 
residue/ 21 to amino acid residue 67 of SEQ ID NO:10; and 
30 f) a polypeptide having the sequence of amino acid residue 
23 t)6 amino acid residue 67 of SEQ ID NO: 2; wherein the 
polypeptide has cysteine residues corresponding to amino 
aold residues 33, 40, 45, 55, 62 and 63 of SEQ ID NOs : 2 or 
0 . 

3 5 Within another aspect the invention provides an 

isolated polynucleotide molecule encoding a protein having 
cysteine residues corresponding to amino acid residues 33, 
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40, 45, 55, 62 and 63 of SEQ ID NO:10, the polynucleotide 
molecule consisting of a coding strand and a complementary 
non-coding strand, wherein the polynucleotide comprises a 
nucleotide sequence that is at least 80% identical to the 
5 sequence of a polynucleotide selected from the group 
consisting of: a) a polynucleotide as shown in SEQ ID NO: 9 
from nucleotide 220 to nucleotide 420; b) a polynucleotide 
as shown in SEQ ID NO : 9 from nucleotide 280 to nucleotide 
420; and c) a polynucleotide as shown in SEQ ID NO: 9 from 
=Jl0 nucleotide 286 to nucleotide 420. 

O Within yet another aspect is provided an 

H isolated polynucleotide molecule encoding a protein having 
M. cysteine residues corresponding to amino acid residues 33, 

|? 40, 45, 55, 62 and 63 of SEQ ID NO: 10, the polynucleotide 

■y] 

. 15 molecule consisting of a coding strand and a complementary 
CJ non-coding strand, wherein the polynucleotide comprises a 
S nucleotide sequence as shown in SEQ ID NO: 11. 

I 1 ! Within another aspect is provided an isolated 

JJj polynucleotide molecule encoding a polypeptide selected 
20 from the group consisting of: a) amino acid residue 30 to 
amino acid residue 63 of SEQ ID NO: 2; b) amino acid 
residue 31 to amino acid residue 63 of SEQ ID NO: 2; c) 
amino acid residue 30 to amino acid residue 64 of SEQ ID 
NO: 2; d) amino acid residue 31 to amino acid residue 64 
25 of SEQ ID NO: 2; and e) a polypeptide chosen from SEQ ID 
NOs:14-72. 

The invention also provides an isolated 
polynucleotide molecule selected from the group consisting 
of: a) nucleotide 88 to nucleotide 189 of SEQ ID NO:l; 
30 b) nucleotide 88 to nucleotide 192 of SEQ ID NO-.l; 

c) nucleotide 91 to nucleotide 189 of SEQ ID N0:1; 

d) nucleotide 91 to nucleotide 192 of SEQ ID NO:l; 

e) nucleotide 88 to nucleotide 189 of SEQ ID NO: 4; 

f) nucleotide 88 to nucleotide 192 of SEQ ID NO : 4 ; 

35 g) nucleotide 91 to nucleotide 189 of SEQ ID NO:4 and 
h) nucleotide 91 to nucleotide 192 of SEQ ID NO : 4 . 



Within still another aspect is provided an 
expression vector comprising the following operably linked 
elements: a transcription promoter; a DNA segment 
encoding a polypeptide selected from the group consisting 
5 of : a) a protein of. claim 1; a) amino acid residue 3 0 to 
amino acid residue 63 of SEQ ID NO: 2; b) amino acid 
residue 31 to amino acid residue 63 of SEQ ID NO:2; c) 
amino acid residue 30 to amino acid residue 64 of SEQ ID 
NO: 2; d) amino acid residue 31 to amino acid residue 64 
mlO of SEQ ID NO:2; and e) a polypeptide chosen from SEQ ID 
O NOs:14-72; and a transcription terminator. Within one 

if] 

embodiment the DNA segment further encodes a secretory 
f-i signal sequence operably linked to the protein. Within a 
%l related embodiment the secretory signal sequence is 
, 15 selected from the group consisting of: a) a polypeptide 
having the sequence of amino acid residue 1 to amino acid 
p residue 18 of SEQ ID NO: 2; b) a polypeptide having the 

fJl sequence of amino acid residue 1 to amino acid residue 20 
jjj of SEQ ID N0:2; c) a polypeptide having the sequence of 
20 amino acid residue 1 to amino acid residue 20 of SEQ ID 
NO: 10; and d) a polypeptide having the sequence of amino 
acid residue 1 to amino acid residue 22 of SEQ ID NO: 10. 

Within another aspect the invention provides a 
cultured cell into which has been introduced an expression 
25 vector comprising the following operably linked elements: 
a transcription promoter; a DNA segment encoding a 
polypeptide as described above; and a transcription 
terminator; wherein the cell expresses the protein encoded 
by the DNA segment. 
3 0 Within a further aspect is provided a method of 

producing a' protein comprising: culturing a cell into 
which has been introduced an expression vector comprising 
the following operably linked elements: a transcription 
promoter; a DNA segment encoding a polypeptide as 
35 described above; and a transcription terminator; whereby 
the cell expresses the protein encoded by the DNA segment; 
and recovering the expressed protein. 
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The invention also provides an oligonucleotide 

probe or primer comprising at least 14 contiguous 

nucleotides of a polynucleotide of SEQ ID NO: 11 or a 

sequence complementary to SEQ ID NO: 11. 
5 The invention further provides a method of 

treating a microbial-related disease comprising 
administering to a mammal a therapeutically effective 
amount of a polypeptide selected from the group consisting 
of: a) a polypeptide of SEQ ID NO : 2 ; b) a polypeptide of 
MlO SEQ ID NO: 10; c) a polypeptide chosen from SEQ ID NOs:14- 
S 72; h) amino acid residue 30 to amino acid residue 63 of 
4.1 SEQ ID . NO: 2; i) amino acid residue 31 to amino acid 
residue 63 of SEQ ID NO: 2; j) amino acid residue 30 to 
y! amino acid residue 64 of SEQ ID NO: 2; k) amino acid 

yi 15 residue 31 to amino acid residue 64 of SEQ ID NO:2; 1) 
p amino acid residue 20 to amino acid residue 67 of SEQ ID 

NO:10 and m) amino acid residue 22 to amino acid residue 
67 of SEQ ID NO: 10; whereby said polypeptide ameliorates 
Q said disease. Within one embodiment the microbial-related 
" f 20 disease is associated with the eye. Within a related 
embodiment the microbial-related disease is 

conjunctivitis. Within another embodiment the microbial- 
related disease is associated with the ear. 

Also provided is a method of contraception 
25 comprising administering to a mammal a therapeutically 
effective amount of a polypeptide selected from the group 
consisting of: a) a polypeptide of SEQ ID N0:2; b) a 
polypeptide of SEQ ID NO: 10; c) a polypeptide chosen from 
SEQ ID NOs: 14-72; h) amino acid residue 30 to amino acid 
30 residue 63 of SEQ ID N0:2; i) amino acid residue 31 to 
amino acid residue 63 of SEQ ID NO: 2; j) amino acid 
residue 30 to amino acid residue 64 of SEQ ID NO: 2; k) 
amino acid residue 31 to amino acid residue 64 of SEQ 
ID NO:2; 1) amino acid residue 20 to amino acid residue 67 
35 of SEQ ID NO: 10 and. m) amino acid residue 22 to amino acid 
residue 67 of SEQ ID NO: 10. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 illustrates the three disulfide bond 
structure of the conserved (3-defensin motif. 

Figure 2 illustrates a multiple alignment of 
5 mature, processed human SAP-1 (see, for example, Bensch et 
al., FEBS Lett . 368 (2) : 331-5, 1995) and the zampl 
polypeptide of the present invention. 

u DETAILED DESCRIPTION OF THE INVENTION 

'. 10 Prior to setting forth the invention in detail, 

~n it may be helpful to the understanding thereof to define 

¥' the following terms. 

% The term "affinity tag" is used herein to denote 

0? a peptide segment that can be attached to a polypeptide to 
!L 15 provide for purification of the polypeptide or provide 
hj sites for attachment of the polypeptide to a substrate. 

C< in principal, any peptide or protein for which an antibody 

q -or other specific binding agent is available can be used 
fU as an affinity tag. Affinity tags include a poly- 
20 histidine tract, protein A (Nilsson et al., EMBO J. 
4:1075, 1985; Nilsson et al . , Methods Enzymol . 198 :3 , 
1991), glutathione S transferase (Smith and Johnson, Gene 
67:31, 1988), substance P, Flag™ peptide (Hopp et al . , 
Biotechnology 6: 1204-1210, 1988; available from Eastman 
25 Kodak Co., New Haven, CT) , streptavidin binding peptide, 
or other antigenic epitope or binding domain. See, in 
general Ford et al . , Protein Expression and Purification 
2: 95-107, 1991. DNAs encoding affinity tags are 

available from commercial suppliers (e.g., Pharmacia 
30 Biotech, Piscataway, NJ) . 

The term "allelic variant" denotes any of two or 
more alternative forms of a gene occupying the same 
chromosomal locus. Allelic variation arises naturally 
through mutation, and may result in phenotypic 
35 polymorphism within populations. Gene mutations can be 
silent (no change in the encoded polypeptide) or may 
encode polypeptides having altered amino acid sequence. 
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The term allelic variant is also used herein to denote a 
protein encoded by an allelic variant of a gene. 

The terms "amino-terminal" and "carboxyl - 
terminal" are used herein to denote positions within 
5 polypeptides and proteins. Where the context allows, 
these terms are used with reference to a particular 
sequence or portion of a polypeptide or protein to denote 
proximity or relative position. For example, a certain 
sequence positioned carboxyl- terminal to a reference 
P,10 sequence within a protein is located proximal to the 

0 carboxyl terminus of the reference sequence, but is not 
J necessarily at the carboxyl terminus of the complete 
M protein . 

The term "complement/anti-complement pair" 

1 15 denotes non-identical moieties that form a non-covalently 

associated, stable pair under appropriate conditions. For 
p instance, biotin and avidin (or streptavidin) are 
prototypical members of a complement/anti-complement pair, 
pj Other exemplary complement/anti-complement pairs include 

20 receptor /ligand pairs, antibody/antigen (or hapten or 
epitope) pairs, sense/antisense polynucleotide pairs, and 
the like. Where subsequent dissociation of the 

complement/ ant i- complement pair is desirable, the 
complement /ant i- complement pair preferably has a binding 

25 affinity of <10 9 M 1 . 

The term "complements of polynucleotide 
molecules" denotes polynucleotide molecules having a 
complementary base sequence and reverse orientation as 
compared to a reference sequence. For example, the 

30 sequence 5' ATGCACGGG 3' is complementary to 5' CCCGTGCAT 
3 ' . 

The term "degenerate nucleotide sequence" 
denotes a sequence of nucleotides that includes one or 
more degenerate codons (as compared to a reference 
35 polynucleotide molecule that encodes a polypeptide) . 
Degenerate codons contain different triplets of 
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nucleotides, but encode the same amino acid residue (i.e., 
GAU and GAC triplets each encode Asp) . 

The term "expression vector" denotes a DNA 
molecule, linear or circular, that comprises a segment 
encoding a polypeptide of interest operably linked to 
additional segments that provide for its transcription. 
Such additional segments may include promoter and 
terminator sequences, and may optionally include one or 
more origins of replication, one or more selectable 
markers, an enhancer, a polyadenylation signal, and the 
like. Expression vectors are generally derived from 
plasmid or viral DNA, or may contain elements of both. 

The term "isolated", when applied to a 
polynucleotide molecule, denotes that the polynucleotide 
has been removed from its natural genetic milieu and is 
thus free of other extraneous or unwanted coding 
sequences, and is in a form suitable for use within 
genetically engineered protein production systems. Such 
isolated molecules are those that are separated from their 
natural environment and include cDNA and genomic clones. 
Isolated DNA molecules of the present invention are free 
of other genes with which they are ordinarily associated, 
but may include naturally occurring 5' and 3' untranslated 
regions such as promoters and terminators. The 
identification of associated regions will be evident to 
one of ordinary skill in the art (see for example, Dynan 
and Tijan, Nature 316:774-78, 1985). When applied to a 
protein, the term "isolated" indicates that the protein is 
found in a condition other than its native environment, 
such as apart from blood and animal tissue. In a 
preferred form, the isolated protein is substantially free 
of other proteins, particularly other proteins of animal 
origin. It is preferred to provide the protein in a 
highly purified form, i.e., greater than 95% pure, more 
preferably greater than 99% pure. 

The term "operably linked", when referring to 
DNA segments, denotes that the segments are arranged so 
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that they function in concert for their intended purposes, 
e.g. transcription initiates in the promoter and proceeds 
through the coding segment to the terminator. 

The term "ortholog" denotes a polypeptide or 
protein obtained from one. species that is the functional 
counterpart of a polypeptide or protein from a different 
species. Sequence differences among orthologs are the 
result of speciation. 

The term "ortholog" denotes a polypeptide or 
protein obtained from one species that is the functional 
counterpart of a polypeptide or protein from a different 
species. Sequence differences among orthologs are the 
result of speciation. 

The term "polynucleotide" denotes a single- or 
double-stranded polymer of deoxyribonucleotide or 
ribonucleotide bases read from the 5' to the 3' end. 
Polynucleotides include RNA and DNA, and may be isolated 
from natural sources, synthesized in vitro, or prepared 
from a combination of natural and synthetic molecules. 
Sizes of polynucleotides are expressed as base pairs 
(abbreviated "bp"), nucleotides <"nt"), or kilobases 
("kb") . Where the context allows, the latter two terms 
may describe polynucleotides that are single-stranded or 
double-stranded. When the term is applied to double- 
stranded molecules it is used to denote overall length and 
will be understood to be equivalent to the term "base 
pairs". It will be recognized by those skilled in the 
art that the two strands of a double- stranded 
polynucleotide may differ slightly in length and that the 
ends thereof may be staggered as a result of enzymatic 
cleavage,- thus all nucleotides within a double-stranded 
polynucleotide molecule may not be paired. Such unpaired 
ends will in general not exceed 20 nt in length. 

A "polypeptide" is a polymer of amino acid 
residues joined by peptide bonds, whether produced 
naturally or synthetically. Polypeptides of less than 
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about 10 amino acid residues are commonly referred to as 
"peptides" . 

The term "promoter" denotes a portion of a gene 
containing DNA sequences that provide for the binding of 
5 RNA polymerase and initiation of transcription. Promoter 
sequences are commonly, but not always, found in the 5' 
non-coding regions of genes. 

The term "receptor" denotes a cell-associated 
protein that binds to a bioactive molecule (i.e., a 
QlO ligand) and mediates the effect of the ligand on the cell. 
Membrane -bound receptors are characterized by a multi- 
domain structure comprising an extracellular ligand- 
H binding domain and an intracellular effector domain that 

g\ is typically involved in signal transduction. Binding of 
s 15 ligand to receptor results . in a conformational change in 
!*; the receptor that causes an interaction between the 
□ effector domain and other molecule (s) in the cell. This 
interaction in turn leads to an alteration in the 
fjj metabolism of the cell. Metabolic events that are linked 
20 to receptor-ligand interactions include gene 

transcription, phosphorylation, dephosphorylation, 

increases in cyclic AMP production, mobilization of 
cellular calcium, mobilization of membrane lipids, cell 
adhesion, hydrolysis of inositol lipids and hydrolysis of 
25 phospholipids. Most nuclear receptors also exhibit a 
multi-domain structure, including an amino- terminal, 
transactivating domain, a DNA binding domain and a ligand 
binding domain. In general, receptors can be membrane 
bound, cytosolic or nuclear; monomeric (e.g., thyroid 
30 stimulating hormone receptor, beta-adrenergic receptor) or 
multimeric (e.g., PDGF receptor, growth hormone receptor, 
IL-3 receptor, GM-CSF receptor, G-CSF receptor, 
erythropoietin receptor and IL-6 receptor) . 

The term "secretory signal sequence" denotes a 
35 DNA sequence that encodes a polypeptide (a "secretory 
peptide") that, as a component of a larger polypeptide, 
directs the larger polypeptide through a secretory pathway 
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of a cell ■ in which it is synthesized. The larger peptide 
is commonly cleaved to remove the secretory peptide during 
transit through the secretory pathway. 

A "soluble receptor" is a receptor polypeptide 
5 that is not bound to a cell membrane. Soluble receptors 
are most commonly ligand-binding receptor polypeptides 
that lack transmembrane and cytoplasmic domains. Soluble 
receptors can comprise additional amino acid residues, 
such as affinity tags that provide for purification of the 

10 polypeptide or provide sites for attachment of the 
polypeptide to a substrate, or immunoglobulin constant 
region sequences. Many cell-surface receptors have 

naturally occurring, soluble counterparts that are 
produced by proteolysis or translated from alternatively 

15 spliced mRNAs . Receptor .polypeptides are said to be 
substantially free of transmembrane and intracellular 
polypeptide segments when they lack sufficient portions of 
these segments to provide membrane anchoring or signal 
transduction, respectively. 

20 Molecular weights and lengths of polymers 

determined by imprecise analytical methods (e.g., gel 
electrophoresis) will be understood to be approximate 
values. When such a value is expressed as "about" X or 
"approximately" X, the stated value of X will be 

25 understood to be accurate to "±10% . 

All references cited herein are incorporated by 
reference in their entirety. 

The present invention is based in part upon the 
discovery of a novel DNA sequence that encodes a 

30 polypeptide having homology to proteins of the p-defensin 
family. That is, the zampl polypeptides of the present 
invention exhibit a conserved motif shown in SEQ ID NO: 3 
and herein: C (X) S C (X) ,C (X) ,GXC (X) S CC wherein "(X)" is the 
number of preferably non-cysteine amino acid residues 

35 between specific amino acids. The cysteine position and 
spacing is characteristic of the p-defensin family. In 
addition, the QIG tripeptide motif embedded in the 
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conserved motif shown in SEQ ID NO: 3 occurs in several 
members of the P-defensin family (for example, SAP-l/hBD- 
2, BNBDs, TAP and the like) . This motif is interpreted to 
indicate the presence of three disulfide bonds in the 0- 
5 defensin structure. Those disulfide bonds are shown in 
Fig. 1. In addition, an intron sequence of approximately 
900 base pairs is found in genomic DNA sequence encoding 
the zampl polypeptide. This intron sequence is inserted 
between the two guanine residues in the codon encoding the 
h»10 glycine residue at amino acid position 20 in SEQ ID NO: 10. 
jjj Such intron placement, in the area between the signal 
vQ sequence and the mature protein occurs in other members of 

the P-defensin family. 
Hi A standard Northern blot tissue distribution of 

uf 15 the mRNA corresponding to. this novel DNA revealed no 
p« expression. It thus appears that normal tissue levels of 
Ul mRNA of zampl polypeptide are below the detection 
sensitivity of the Northern blot. Such an observation is 
p consistent with the knowledge in the art regarding 
•20 defensins, i.e. , that they are constitutively expressed at 
low levels but are highly inducible upon infection. 
Electronic analysis of tissue distribution based upon 
libraries where the sequence is found indicate that zampl 
polypeptide is expressed in bronchial epithelia. 
25 The novel zampl polypeptides of the present 

invention were initially identified by querying an EST 
database for homologous sequences to the SAP-1 human 
defensin isolated from human psoriatic skin. A single EST 
sequence was discovered in a bronchial epithelium cDNA 
30 library and was predicted to be related to the p-defensin 
family. A second search based upon the p-defensin 
consensus motif also identified the EST. Thus, the 
consensus motif is found in the zampl polypeptide as well 
as in the SAP-1 protein; however, the remaining sequence 
35 of the two proteins is divergent, characterized by 
approximately 43% identity at the amino acid level. See, 
for example, the multiple alignment shown in Fig. 2. 
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The nucleotide sequence of the zampl -polypeptide 
is described in SEQ ID N0:1 and SEQ ID NO: 9, and its 
deduced amino acid sequence is described in SEQ ID NO: 2 
and SEQ ID NO:10, respectively. The zampl polypeptide, by 
sequence analysis, can be grouped with the two human P~ 
defensins, hBD-1 and hBD-2 (SAP-1) , but. it is most closely 
sequence-related to hBD-2 and the bovine BNBDs and less 
similar to hBD-1. 

Preliminary computer-aided model building 
efforts to construct a three-dimensional model structure 
for zampl polypeptide indicate that it is feasible to 
generate physically reasonable model structures using 
BNBD_12 (Zimmermann et al., Biochemistry 34 (41) : 13663- 
13671, 1995) as a template. Although there is relatively 
low sequence identity between these two peptides, their 
overall secondary structure is very similar. The most 
variability is observed in the loop regions, which is not 
alarming since the loop segments represent one of many 
possible conformations for each loop. Both structures are 
built primarily of an anti-parallel beta sheet core, four- 
stranded in the BNBD_12 model and three-stranded in the 
zampl polypeptide model. A turn formed between two of the 
beta strands in the BNBD_12 chain is also found in the 
zampl polypeptide model connecting two beta strands. The 
overall folding of BNBD_12 follows the pattern of beta 
strand/short beta strand/short beta strand/turn/beta 
strand. Folding of zampl polypeptide consists of beta 
strand/beta strand/turn/beta strand. These common 

structural elements are highly superimposable . Thus, the 
two polypeptides may well be involved in the same or 
similar biological processes. 

Another aspect of the present invention includes 
zampl polypeptide fragments. Preferred fragments include 
the leader sequence, ranging from amino acid 1 (lie) to 
amino acid 18 (Gly) or 20 (Gly) of SEQ ID NO: 2 and ranging 
from amino acid 1 (Met) to amino acid 20 (Gly) or 22 (Gly) 
of SEQ ID NO: 10. Such leader sequences may be used to 
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direct the secretion of other polypeptides. Such 
fragments of the present invention may be used as follows: 
the alternative secretion leader fragments are formed as 
fusion proteins with alternative proteins selected for 
secretion; plasmids bearing regulatory regions capable of 
directing the expression of the fusion protein are 
introduced into test cells; and secretion of the protein 
is monitored. 

Additional preferred fragments include the (}- 
hairpin loop of the zampl polypeptide, amino acid 30 (Lys) 
or 31 (Tyr) to amino acid 63 (Cys) or amino acid 64 (Arg) 
of SEQ ID NO:2, SEQ ID N0:14, and SEQ ID NOs:18-34. Also 
provided are the polypeptides of SEQ ID NO: 15, SEQ ID 
NO:16, SEQ ID NO : 17 , SEQ ID NOs:35-72, that eliminate the 
structural complications presented by the free cysteine 
residues (amino acids 55 and 62 of SEQ ID NO: 2) and 
maintain the hydrophobic face of the polypeptide. The 
fragments provided by the invention would be useful in 
applications, described herein, where administration of 
zampl would be beneficial, such as anti-microbial agents 
(Thennarasu and Nagaraj , Biochem. Biophys. Res. Comm . 
254.:281-3, 1999) . 

The present invention also provides fusion 
constructs incorporating the zampl polypeptide selected 
from the group consisting of: (a) polypeptide molecules 
comprising a sequence of amino acid residues as shown in 
SEQ ID NO:2 from amino acid residue 1 (lie), 19 (His) or 
21 (Gly) to amino acid residue 65 (Lys) or a sequence of 
amino acid residues as shown in SEQ ID NO: 10 from amino 
acid residue 1 (Met) , 21 (His) or 23 (Gly). to amino acid 
residue 67 (Lys) ; or (b) mammalian species homologs or 
human paralogs of (a) ; at least the mature polypeptide 
region of another defensin molecule; and, optionally, a 
polypeptide linker there between. When defensin molecules 
having disparate spectrum of pathogens, fusion constructs 
containing the same are expected to exhibit a broader 
range of anti-microbial effectiveness. Polypeptide 
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linkers are preferably employed if necessary to provide 
separation of component polypeptides of the fusion or to 
allow for flexibility of the fusion protein, thereby 
preserving the anti -microbial activity of each defensin 
5 component of the fusion protein. Those of ordinary skill 
in the art are capable of designing such linkers. 

The highly conserved amino acids in the 
consensus domain of zampl polypeptide can be used as a 
^ tool to identify new family members. For instance, 
ClO reverse transcript ion-polymerase chain reaction (RT-PCR) 

o 

can be used to amplify sequences encoding the conserved 
{"» motif from RNA obtained from a variety of tissue sources. 
IT More specifically, the following probes can be employed to 
O's identify other human or zampl-like [3-def ensms . A 
^15 preferred embodiment of this aspect of the present 
[a invention ranges between amino acid residues 31 and 61 of 
: SEQ ID NO: 2 (corresponding to nucleotides 91-183 of SEQ 

ID NO: 1). In particular, highly degenerate primers 
flj designed from the above sequences are useful for this 
20 purpose. 

SEQ ID NO: 4 is a degenerate polynucleotide 
sequence that encompasses all polynucleotides that encode 
the zampl polypeptide of SEQ ID NO: 2 (amino acids 1-65) . 
SEQ ID NO: 11 is a degenerate polynucleotide sequence that 

25 encompasses all polynucleotides that encode the zampl 
polypeptide of SEQ ID NO: 10. Thus, zampl polypeptide- 
encoding polynucleotides ranging from nucleotide 1, 61 or 
67 to nucleotide 195 or 213 of SEQ ID NO: 4 and ranging 
from nucleotide 1, 61 or 67 to nucleotide 201 or 219 of 

30 SEQ ID NO: 11 are contemplated by the present invention. 
Also contemplated by the present invention are fragments 
and fusions as described above with respect to SEQ ID NO: 
1 and SEQ ID NO: 10, which are formed from analogous 
regions of SEQ ID NO: 4 and SEQ ID NO: 11. The symbols in 

35 SEQ ID NO: 4 are summarized in Table 1 below. 
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The degenerate codons used in SEQ ID NO: 4 and 
SEQ ID NO: 11, encompassing all possible codons for a 
given amino acid, are set forth in Table 2 below. 



20 



TABLE 2 





Amino 
Acid 


Letter 


Codons 


Degenerate 
Codon 




Cys 


c 


TGC TGT 


TGY 




Ser 


s 


ARC AGT "IT A T(T TRR TTT 


WOIV 




Thr 


j 


ARA Arr ARG ACT 


ARM 




Pro 


p 


RRA CCC RRG CCJ 


RRN 




Ala 
m i a 


A 


rra grr grg rrt 

\JL>r\ uijU vULiU uVj j 
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u i y 
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Lj>Jr\ UIUi-i uuu uu 1 
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U II 1 
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Lys 




AAA AAG 


AAR 




Met 


M 


ATG 


ATG 




Tip 


T 
I 


ATA ATR ATT 


ATH 




Leu 


R_ 


CTA RTR RTR. TTT TTA TTG 


YTN 




Val 




RTA GTR RTR RTT 


RTN 




Phe 


F 


TTC TTT 


TTY 




Tyr 


Y 


TAC TAT 


TAY 




Trp 


W 


TGG 


TGG 




Ter 




TAA TAG TGA 


TRR 




Asn | Asp 


B 




RAY 




G1u|Gln 


Z 




SAR 




Any 


X 




NNN 




Gap 









One of ordinary skill in the art will appreciate 
5 that some ambiguity is introduced in determining a 
degenerate codon, representative of all possible codons 
encoding each amino acid. For example, the degenerate 
codon for serine (WSN) can, in some circumstances, encode 
arginine (AGR) , and the degenerate codon for arginine 
10 (MGN) can, in some circumstances, encode serine (AGY) . A 
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similar relationship exists between codons encoding 
phenylalanine and leucine. Thus, some polynucleotides 
encompassed by the degenerate sequence may encode variant 
amino acid sequences, but one of ordinary skill in the art 
5 can easily identify such variant sequences by reference to 
the amino acid sequence of SEQ ID NO: 2 and SEQ ID NO: 10. 
Variant sequences can be readily tested for functionality 
as described herein. 
y. Within another aspect of the present invention 

Cjio there is provided a pharmaceutical, composition comprising 
3 purified zampl polypeptide in combination with a 
f° pharmaceutically acceptable vehicle. Such pharmaceutical 
compositions are used in the treatment of conditions 
ffi associated with pathological microbes, including 
%-,15 bacterial, fungal and viral infections. Antibacterial 
W applications of zampl polypeptide include situations where 
h! the pathogen has become resistant to standard treatments. 
Q For example, hospital sepsis is an increasing problem, 
• iJ since Staphylococcus strains have become resistant to 
20 commonly used antibiotics. 

In general, anti-microbial activity of zampl 
polypeptides, fragments, fusions, antibodies, agonists and 
antagonists can be evaluated by techniques that are known 
in the art. More specifically, anti-microbial activity 
25 can be assayed by evaluating the sensitivity of microbial 
cell cultures to test agents and by evaluating the 
protective effect of test agents on infected mice. See, 
for example, Musiek et al . , Antimicrob. Agents Chemothr. 
3.: 40, 1973. Antiviral activity can also be assessed by 
30 protection of mammalian cell cultures. Known techniques 
for evaluating anti-microbial activity include, for 
example, Barsum et al . , Eur. Respir. J. 8(5) : 709-14, 
1995; Sandovsky-Losica et al . , J. Med. Vet. Mvcol 
(England) 28 (4) : 279-87, 1990; Mehentee et al., J. Gen. 
35 Microbiol (England) 13 5 (Pt. 8) : 2181-8, 1989; Segal and 
Savage, Journal of Medical and Veterinary Mycology 24 : 
477-479, 1986 and the like. Known assays specific for 
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anti-viral activity include, for example, those described 
by Daher et al . , J. Virol. 60(3) : 1068-74, 1986. 

In addition, contract laboratories offer 
services in evaluating anti -microbial properties. For 
5 example, Panlabs, Inc. of Bothell, Washington offer in 
vitro or in vivo testing for bacteria, gram negative 
(Enterobacter cloacae, Escherichia coli, Klebsiella 
pneumonia, Proteus vulgaris, Pseudomonas aeruginosa, 
Salmonella typhimurium and Serratia marcescens) , gram 
ao positive (Bacillus subtilis , Brevebacterium ammoniagenes , 
Corynebacterium minutissimum, ■. Micrococcus luteus, 
I Mycobacterium ranae. Staphylococcus strains and 
Streptococcus strains) and anaerobic organisms 
(.Actinomyces viscosus, Bacteroides fragilis, Clostridium 
15 sporogenes, Corynebacterium. acnes, Helicobacter pylori and 
Porphyromonas gingivalis) , as well as for protozoa 
j (Trichomonas foetus) and fungi ( e.g. . Candida albicans, 
Epidermophyton floccosum, Exophiala jeanselmei, 

Microsporum strains. Trichophyton strains and the like) . 
20 Also, Molecular Probes of Oregon has commercially 
available fluorescence technology for use in bacteriology. 

If desired, zampl polypeptide, fragment, fusion 
protein, agonist, antagonist or antibody performance in 
this regard can be compared to proteins known to be 
25 functional in this regard, such as proline-rich proteins, 
lysozyme, histatins, lactoperoxidase or the like. In 
addition, zampl polypeptide, fragment, fusion protein, 
antibody, agonist or antagonist may be evaluated in 
combination with one or more anti-microbial agents to 
30 identify synergistic effects. 

Defensins have been found associated with the 
tissues and secretions of the human eye and are useful in 
treating microbial-related diseases of the eye (Cullor et 
al., Arch. Ophthalmol . 108 :661-4, 1990; Murphy et al . , US 
35 Patent No. 5,242,902; Hattenbach et al . , Antimicrob . 
Agent. Chemother . 42.: 332, 1998; Haynes et al . , Lancet 
354:451-2, 1998 and Haynes et al . , Br. J. Ophthalmol . 
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83:737-41, 1999). Addition of defensin polypeptides 
reduced microbial contamination of corneal preservation 
medium. Such agents are useful for reducing infectious 
postoperative complications associated with such 
transplantations (Schwab et al . , Cornea 11:370-5, 1992) . 

Thus, zampl polypeptides, agonists or 
antagonists thereof may be therapeutically useful for 
treatment of infection and inflammation associated with 
the eye. To verify the presence of this capability in 
zampl polypeptides, agonists or antagonists of the present 
invention, such zampl polypeptides, agonists or 
antagonists are evaluated with respect to their anti- 
microbial and chemotactic activities according to 
procedures known in the art. If desired, zampl 

polypeptide performance in this regard can be compared to 
other a and (S defensins, rabbit neutrophil defensins and 
the like. In addition, zampl polypeptides or agonists or 
antagonists thereof may be evaluated in combination with 
one or more anti -microbial molecules to identify 

synergistic effects. 

Similarly, zampl polypeptides of the present 

invention would also be therapeutically useful for 
treatment of infection and inflammation associated with 
the ear. The invention provides a method of treating a 
microbial-related disease comprising administering to a 
mammal a therapeutically effective amount of a zampl 
polypeptide whereby the polypeptide ameliorates the 
disease. Preferably such polypeptides include the P- 
hairpin loop of the zampl polypeptide, amino acid 30 (Lys) 
or 31 (Tyr) to amino acid 63 (Cys) or amino acid 64 (Arg) 
of SEQ ID NO:2, SEQ ID NO : 14 and SEQ ID NOs:18-34. 
Additional exemplary polypeptides include the polypeptides 
of SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17 and SEQ ID 
NOs: 35-72. One such application would be for microbial- 
related disease is associated with the eye, such as 
conjunctivitis. Application could also be made for 
diseases associated with the ear. 
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The invention also provides a method of 
contraception wherein a therapeutically effective amount 
of the polypeptides of the present invention is 
administered to a mammal. Such administration would be 
5 useful in preventing implantation and/or development of 
the embryo . 

Defensins have also been found to activate the 
complement cascade (Prohaszka et al . , Mol Immunol . 34 : 809- 
16, 1997 and Prohaszka and Fust, Lancet 352 : 1152 ■ 1998) 
j=10 The complement component Clq plays a role in host defense 
p against infectious agents, such as bacteria and viruses, 
ill Clq is known to exhibit several specialized functions. 

For example, Clq triggers the complement cascade via 
En interaction with bound antibody or C-reactive protein 
U "15 (CRP) . Also, Clq interacts directly with certain 

Pi bacteria, RNA viruses, mycoplasma, uric acid crystals, the 
|*| lipid A component of bacterial endotoxin and membranes of 

: certain intracellular organelles. Clq binding to the Clq 

D receptor is believed to promote phagocytosis . Clq also 

iif 20 appears to enhance the antibody formation aspect of the 
host defense system. See, for example, Johnston, Pediatr . 
Infect. Pis. J. 12: 933-41, 1993. Thus, complement- 
activating defensins would act as enhanced anti-microbial 
agents, promoting lysis or phagocytosis of infectious 
25 agents. 

The anti-microbial activity of defensins is also 
useful for contraceptive applications (Sawicki and 
Mystkovska, Lancet 353:464-5, 1999) . 

The pharmaceutical compositions of the present 

30 invention may also be used when pro-inflammatory activity 
is desired. Applications for such pro- inflammatory 

activity include the treatment of chronic tissue damage, 
particularly in areas having a limited or damaged vascular 
system, e.g., damage in extremities associated with 

35 diabetes. In contrast, antagonists to zampl polypeptides 
may be useful as anti- inflammatory agents. 
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Zampl polypeptide pharmaceutical compositions, of 
the present invention may also be used in the treatment of 
conditions where stimulation of immune responsiveness is 
desired. Such conditions include the treatment of 

5 patients having incompetent immune systems, such as AIDs 
patients or individuals that have undergone chemotherapy, 
radiation treatment or the like. 

Because zampl polypeptide was found in a 
bronchial epithelia library and cystic fibrosis is 

10 characterized by frequent microbial infection, 
pharmaceutical compositions containing zampl -polypeptide 
are also contemplated for use in the treatment of lung 
infections associated with cystic fibrosis. Also 
contemplated by the present invention are engineered zampl 

15 polypeptides that are characterized by decreased 
sensitivity to salt concentration. Decreased sensitivity 
to high salt concentration will preserve anti -microbial 
activity of engineered zampl polypeptides in high salt 
environments, such as in the lung airways of patients 

20 suffering from cystic fibrosis. In this manner, 

pharmaceutical compositions containing engineered zampl 
polypeptides that are formulated for delivery to the lungs 
can be used to treat lung infections associated with 
cystic fibrosis. 

25 Radiation hybrid mapping is a somatic cell 

genetic technique developed for constructing high- 
resolution, contiguous maps of mammalian chromosomes (Cox 
et al., Science 250 :245-50, 1990). Partial or full 
knowledge of a gene's sequence allows one to design PCR 

30 primers suitable for use with chromosomal radiation hybrid 
mapping panels. Commercially available radiation hybrid 
mapping panels which cover the entire human genome, such 
as the Stanford G3 RH Panel and the GeneBridge 4 RH Panel 
(Research Genetics, Inc., Huntsville, AL) , are available. 

35 These panels enable rapid, PCR-based chromosomal 
localizations and ordering of genes, sequence -tagged sites 
(STSs) , and other nonpolymorphic and polymorphic markers 
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within a region of interest. This includes establishing 
directly proportional physical distances between newly 
discovered genes of interest and previously mapped 
markers. The precise knowledge of a gene's position can 
5 be useful for a number of purposes, including: 1) 
determining if a sequence is part of an existing contig 
and obtaining additional surrounding genetic sequences in 
various forms, such as YACs, BACs or cDNA clones; 2) 
providing a possible candidate gene for an inheritable 

10 disease which shows linkage to the same chromosomal 
region; and 3) cross-referencing model organisms, such as 
mouse, which may aid in determining what function a 
particular gene might have. 

The results showed that the zampl gene maps 33.5 

15 cR_3000 from the top of the human chromosome 8 linkage 
group on the WICGR radiation hybrid map. Proximal and 
distal framework markers were CHLC . GATA6 2D1 0 and WI-3823 
(D8S1511) , respectively. The use of surrounding markers 
positions the zampl gene in the 8p23.3-p23.2 region on the 

20 integrated LDB chromosome 8 map (The Genetic Location 
Database, University of Southhampton, WWW server: 
http : //cedar . genetics . soton.ac.uk/ public_html/) . 

Previously, human defensin genes of both 
hematopoietic (such as HD-1, as described by Sparkes et 

25 al., Genomics 5 (2) : 240-4, 1989) and epithelial (such as 
HD-5 and HD-6, as described by Bevins et al.. Genomics 
31 ID : 95-106, 1996) origin are localized on the short arm 
of human chromosome 8 (8p23) . Several defensin genes, 
cryptidins, have been mapped in the mouse genome and are 

30 found in a region of conserved synteny with human on mouse 
chromosome 8. See, for example, Ouellette et al . , 
Genomics 5(2) : 233-9, 1989. Recently, Liu et al . , 

Genomics 43 (3) : 316-20, 1997, reported the mapping of the 
hBD_l gene to the same cluster of defensins on chromosome 

35 8. These authors propose that a- and (5-defensin genes 
arose from a common ancestral gene prior to mammalian 
divergence. Thus, the localization of the zampl 
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polypeptide-encoding gene to this region of chromosome 8 
adds a second human P-defensin to the same chromosomal 
location as the human classic defensins and supports the 
hypothesis for the evolution of defensins. 
5 The present invention also provides reagents 

which will find use in diagnostic applications. For 
example, the zampl gene, a probe comprising zampl DNA or 
RNA or a subsequence thereof can be used to determine if 
f»k the zampl gene is present on chromosome 8 or if a mutation 

'~i 10 has occurred. Detectable chromosomal aberrations at the 

u 

ill zampl gene locus include but are not limited to 

aneuploidy, gene copy number changes, . insertions, 
if! deletions, restriction site changes and rearrangements. 

Such aberrations can be detected using polynucleotides of 
!U 15 the present invention by employing molecular genetic 
y techniques, such as restriction fragment length 

"ri polymorphism (RFLP) analysis, short tandem repeat (STR) 

ul 

p analysis employing PCR techniques, and other genetic 

ttl linkage analysis techniques known in the art (Sambrook et 

20 al . , ibid . ; Ausubel, et. al . , ibid . ; Marian, Chest , 108 : 

255-265, 1995) . 

Another aspect of the present invention involves 

the detection of zampl polypeptides in cell culture or in 

a serum sample or tissue biopsy of a patient undergoing 

25 evaluation for SPG, Chediak-Higashi syndrome or other 
conditions characterized by an alteration in defensin 
concentration. Zampl polypeptides can be detected using 
immunoassay techniques and antibodies capable of 
recognizing a zampl polypeptide epitope. More 

30 specifically, the present invention contemplates methods 
for detecting zampl polypeptide comprising: 

exposing a solution or sample or cell culture 
lysate or supernatant, possibly containing zampl 
polypeptide, to an antibody attached to a solid support, 

35 wherein said antibody binds to a first epitope of a zampl 
polypeptide; 
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washing said immobilized antibody-polypeptide to 

remove unbound contaminants ,- 

exposing the immobilized antibody-polypeptide to 

a second antibody directed to a second epitope of a zampl 
5 polypeptide, wherein the second antibody is associated 

with a detectable label; and 

detecting the detectable label. Zampl 

polypeptide concentration differing from that of controls 

may be indicative of SPG, Chediak-Higashi syndrome or 
Ui 10 other conditions characterized by an alteration in 
JP defensin concentration. In addition, expression of zampl 

jp may be monitored in cystic fibrosis patients as a 

predictor of the onset of infectious crises. Also, high 
m defensin, such as zampl polypeptide, levels have been 

0' 15 associated with cytotoxic effects in lung, indicating that 
jU zampl polypeptide levels can be used to direct treatment 

hi for averting or addressing such cytotoxicity. For 

jtj example, antibodies directed to zampl polypeptide can be 

£j administered to inactivate the same in a treatment 

FU 20 modality. 

Within additional aspects of the invention there 
are provided antibodies or synthesized binding 
proteins ( e.g. , those generated by phage display, E. coli 
Fab, and the like) that specifically bind to the zampl 

25 polypeptides described above. Such antibodies" are useful 
for, among other uses as described herein, preparation of 
anti- idiotypic antibodies. Synthesized binding proteins 
may be produced by phage display using commercially 
available kits, such as the Ph.D.™ Phage Display Peptide 

30 Library Kits available from New England Biolabs, Inc. 
(Beverly, Massachusetts) . Phage display techniques are 
described, for example, in US Patent Nos . 5,223,409, 
5,403,484 and 5,571,698. 

An additional aspect of the present invention 

35 provides methods for identifying agonists or antagonists 
of the zampl polypeptides disclosed above, which agonists 
or antagonists may have valuable properties as discussed 
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further herein. Within one embodiment, there is provided 
a method of identifying zampl polypeptide agonists, 
comprising providing cells responsive thereto, culturing 
the cells in the presence of a test compound and comparing 
5 the cellular response with the cell cultured in the 
presence of the zampl polypeptide, and selecting the test 
compounds for which the cellular response is of the same 
type. 

Within another embodiment, there is provided a 
ll 10 method of identifying antagonists of zampl polypeptide, 
y comprising providing cells responsive to a zampl 

jti polypeptide, culturing a first portion of the cells in the 

r* presence of zampl polypeptide, culturing a second portion 

ifs of the cells in the presence of the zampl polypeptide and 

01 15 a test compound, and detecting a decrease in a cellular 
L response of the second portion of the cells as compared' to 

the first portion of the cells. 
*•:! A further aspect of the invention provides a 

?{ method of studying chemoattraction of monocytes in cell 

ry 20 culture, comprising incubating monocytes in a culture 
medium comprising a zampl polypeptide, fragment, fusion 
protein, antibody, agonist or antagonist to study or 
evaluate monocyte chemoattraction. Such evaluation may be 
conducted using methods known in the art, such as those 
25 described by Territo et al . referenced above. 

Melanocortin receptors are G-coupled protein 
receptors which activate adenylate cyclase and cause 
calcium flux. The agouti protein (which contains a 3 6 
amino acid domain that is toxin-like) is thought to 
30 inhibit the binding of MSH-alpha to MCI and MC4 . In 
addition, the agouti protein is thought to be an 
antagonist of calcium channels, and certain toxins are 
believed to modulate ion flux. Experimental evidence has 
been generated, suggesting that defensins are capable of 
3 5 blocking calcium channels. 

A further aspect of the invention provides a 
method of studying activity of the melanocortin family of 
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receptors in cell culture, comprising incubating cells 
that endogenously bear such receptors ( e.g. , ACTH 
receptors or the like) or cells that have been engineered 
to bear such receptors in a culture medium comprising a 
5 ligand or putative ligand and zampl polypeptide, fragment, 
fusion protein, antibody, agonist or antagonist to study 
or evaluate ligand or putative ligand binding and/or ion 
flux regulation or modulation. Such evaluation may be 
conducted using methods known in the art, such as those 

10 described by Zhu et al . referenced above. 

A further aspect of the invention provides a 
method of studying ion flux in cell culture, comprising 
incubating cells that are capable of ion flux, such as 
calcium flux, sodium flux, potassium flux or the like, in 

15 a culture medium comprising zampl polypeptide, fragment, 
fusion protein, antibody, agonist or antagonist to study 
or evaluate ion flux regulation or modulation. 

A further aspect of the invention provides a 
method of studying cytocidal activity against mammalian 

20 cells, such as tumor cells, in cell culture, comprising 
incubating such cells in a culture medium comprising a 
zampl polypeptide, fragment, fusion protein, antibody, 
agonist or antagonist at high . test agent and low cell 
concentration to study or evaluate cytocidal activity. 

25 Such evaluation may be conducted using methods known in 
the art, such as those described by Lichtenstein et al., 
Blood 68 : 1407-10, 1986 and Sheu et al . , Antimicrob. 
Agents Chemother. 28 : 626-9, 1993. 

Another aspect of the present invention involves 

3 0 the use of zampl polypeptides, fragments, fusion proteins 
or agonists as cell culture reagents in in vitro studies 
of exogenous microorganism infection, such as bacterial, 
viral or fungal infection. Such moieties may also be used 
in in vivo animal models of infection. 

35 An additional aspect of the present invention is 

to study epithelial cell defensin induction in cell 
culture. In this aspect of the present invention, 



epithelial cells are cultured and exposed to pathogenic 
stimuli. Induction of zampl polypeptide production by the 
epithelial cells is then measured. 

Within preferred embodiments of the invention 
the isolated polynucleotides will hybridize to similar 
sized regions of SEQ ID NO: 2, SEQ ID N0:3, SEQ ID NO: 10, 
other probe sequences specifically set forth herein, or a 
sequence complementary thereto, under stringent 
conditions. In general, stringent conditions are selected 
to be about 5°C lower than the thermal melting point (T m ) 
for the specific sequence at a defined ionic strength and 
pH. The T m is the temperature (under defined ionic 
strength' and pH) at which 50% of the target sequence 
hybridizes to a perfectly matched probe. Typical 
stringent conditions are- those in which the salt 
concentration is up to about 0.03 M at pH 7 and the 
temperature is at least about 6 0°C. 

As previously noted, the isolated 

polynucleotides of the present invention include DNA and 
RNA. Methods for isolating DNA and RNA are well known in 
the art. It is generally preferred to isolate RNA from 
bronchial epithelium, although DNA can also be prepared 
using RNA from other tissues or isolated as genomic DNA. 
Total RNA can be prepared using guanidine HC1 extraction 
followed by isolation by centrifugation in a CsCl gradient 
(Chirgwin et al . , Biochemistry 18 :52-94. 1979). Poly (A) + 
RNA is prepared from total RNA using the method of Aviv 
and Leder ( Proc. Natl. Acad. Sci. USA 69 : 14 08-14 12 , 1972) . 
Complementary DNA (cDNA) is prepared from poly (A) + RNA 
using known methods. Polynucleotides encoding zampl 
polypeptides are then identified and isolated by, for 
example, hybridization or PCR. 

The present invention further provides 
counterpart polypeptides and polynucleotides from other 
species (orthologs) . These species include, but are not 
limited to mammalian, avian, amphibian, reptile, fish, 
insect and other vertebrate and invertebrate species. Of 
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particular interest are zampl polypeptides from other 
mammalian species, including murine, rat, porcine, ovine, 
bovine, canine, feline, equine and other primate proteins. 
Species homologs of the human proteins can be cloned using 
information and compositions provided by the present 
invention in combination with conventional cloning 
techniques. For example, a cDNA can be cloned using mRNA 
obtained from a tissue or cell type that expresses the 
protein. Suitable sources of mRNA can be identified by 
probing Northern blots with probes designed from the 
sequences disclosed herein. A library is then prepared 
from mRNA of a positive tissue of cell line. A zampl 
polypeptide-encoding cDNA can then be isolated by a 
variety of methods, such as by probing with a complete or 
partial human cDNA or with one or more sets of degenerate 
probes based on the disclosed sequences. A cDNA can also 
be cloned using the polymerase . chain reaction, or PCR 
(Mullis, U.S. Patent 4,683,202) , using primers designed 
from the sequences disclosed herein. Within an additional 
method, the cDNA library can be used to transform or 
transfect host cells, and expression of the cDNA of 
interest can be detected with an antibody to zampl 
polypeptide. Similar techniques can also be applied to 
the isolation of genomic clones. 

Those skilled in the art will recognize that the 
sequences disclosed in SEQ ID N0:1, SEQ ID NO:2, SEQ ID 
NO: 9 and SEQ ID NO: 10 represent a single allele of the 
human zampl gene and polypeptide, and that allelic 
variation and alternative splicing are expected to occur. 
Allelic variants can be cloned by probing cDNA or genomic 
libraries from different individuals according to standard 
procedures. Allelic variants of the DNA sequence shown in 
SEQ ID NO: 2 and SEQ ID NO: 10, including those containing 
silent mutations and those in which mutations result in 
amino acid sequence changes, are within the scope of the 
present invention. 



Within preferred embodiments of the invention, 
the isolated nucleic acid molecules can hybridize under 
stringent conditions to nucleic acid molecules having at 
least a portion of the nucleotide sequence of SEQ ID 
NOs:l, 4, 9 or 11 or to nucleic acid molecules having a 
nucleotide sequence complementary to those sequences. In 
general, stringent conditions are selected to be about 5°C 
lower than the thermal melting point (T m ) for the specific 
sequence at a defined ionic strength and pH. The T m is 
the temperature (under defined ionic strength and pH) at 
which 50% of the target sequence hybridizes to a perfectly 
matched probe . 

A pair of nucleic acid molecules, such as DNA- 
DNA, RNA-RNA and DNA-RNA, can hybridize if the nucleotide 
sequences have some degree of complementarity. Hybrids 
can tolerate mismatched base pairs in the double helix, 
but the stability of the hybrid is influenced by the 
degree of mismatch. The T m of the mismatched hybrid 
decreases by 1°C for every 1-1.5% base pair mismatch. 
Varying the stringency of the hybridization conditions 
allows control over the degree of mismatch that will be 
present in the hybrid. The degree of stringency increases 
as the hybridization temperature increases and the ionic 
strength of the hybridization buffer decreases. Stringent 
hybridization conditions encompass temperatures of about 
5-25°C below the T m of the hybrid and a hybridization 
buffer having up to 1 M Na*. Higher degrees of stringency 
at lower temperatures can be achieved with the addition of 
formamide which reduces the T n of the hybrid about 1°C for 
each 1% formamide in the buffer solution. Generally, such 
stringent conditions include temperatures of 20-70°C and a 
hybridization buffer containing up to 6xSSC and 0-50% 
formamide. A higher degree of stringency can be achieved 
at temperatures of from 40-70°C with a hybridization 
buffer having up to 4xSSC and from 0-50% formamide. 
Highly stringent conditions typically encompass 
temperatures of 42-70°C with a hybridization buffer having 



34 



up to IxSSC and 0-50% formamide. Different degrees of 
stringency can be used during hybridization and washing to 
achieve maximum specific binding to the target sequence. 
Typically, the washes following hybridization are 
5 performed at increasing degrees of stringency to remove 
non-hybridized polynucleotide probes from hybridized 
complexes . 

The above conditions are meant to serve as a 
U guide and it is well within the abilities of one skilled 

O 10 in the art to adapt these conditions for use with a 
particular polypeptide hybrid. The T m for a specific 
target sequence is the temperature (under defined 
conditions) at which 50% of the target sequence will 
hybridize to a perfectly matched probe sequence. Those 
15 conditions which influence- the T, include, the size and 
base pair content of the polynucleotide probe, the ionic 
strength of the hybridization solution, and the presence 
of destabilizing agents in the hybridization solution. 
Numerous equations for calculating T„ are known in the art, 
20 and are specific for DNA, RNA and DNA-RNA hybrids and 
polynucleotide probe sequences of varying length (see, for 
example, Sambrook et al.. Molecular Cloning: A Laboratory 
Manual, Second Edition (Cold Spring Harbor Press 1989) ; 
Ausubel et al . , (eds.), Current Protocols in Molecular 
25 Biology fjohn Wiley and Sons, Inc. 1987); Berger and 
Kimmel (eds.). Guide to Molecular Cloning Techniques , 
(Academic Press, Inc. 1987); and Wetmur, Crit. Rev. 
Biochem. Mol . Biol . 26.:227 (1990)). Sequence analysis 
software, such as OLIGO 6.0 (LSR; Long Lake, MN) and 
30 Primer Premier 4.0 (Premier Biosoft International; Palo 
Alto, CA) , as well as sites on the Internet, are available 
tools for analyzing a given sequence and calculating T m 
based on user defined criteria. Such programs can also 
analyze a given sequence under defined conditions and 
35 identify suitable probe sequences. Typically, 
hybridization of longer polynucleotide sequences, >50 base 
pairs, is performed at temperatures of about 20-25°C below 



the calculated T„. For smaller probes, <50 base pairs, 
hybridization is typically carried out at the or 5-10°C 
below. This allows for the maximum rate of hybridization 
for DNA-DNA and DNA-RNA hybrids. 

The length of the polynucleotide sequence 
influences the rate and stability of hybrid formation. 
Smaller probe sequences, <50 base pairs, reach equilibrium 
with complementary sequences rapidly, but may form less 
stable hybrids. Incubation times of anywhere from minutes 
to hours can be used to achieve hybrid formation. Longer 
probe sequences come to equilibrium more slowly, but form 
more stable complexes even at lower temperatures . 
Incubations are allowed to proceed overnight or longer. 
Generally, incubations are carried out for a period equal 
to three times the calculated Cot time. Cot time, the 
time it takes for the polynucleotide sequences to 
reassociate, can be calculated for a particular sequence 
by methods known in the art. 

The base pair composition of polynucleotide 
sequence will effect the thermal stability of the hybrid 
complex, thereby influencing the choice of hybridization 
temperature and the ionic strength of the hybridization 
buffer. A-T pairs are less stable than G-C pairs in 
aqueous solutions containing sodium chloride. Therefore, 
the higher the G-C content, the more stable the hybrid. 
Even distribution of G and C residues within the sequence 
also contribute positively to hybrid stability. In 
addition, the base pair composition can be manipulated to 
alter the T. of a given sequence. For example, 5- 
methyldeoxycytidine can be substituted for deoxycytidine 
and 5-bromodeoxuridine can be substituted for thymidine to 
increase the T m whereas 7-deazz-2 ' -deoxyguanosine can be 
substituted for guanosine to reduce dependence on T. . 

The ionic concentration of the hybridization 
buffer also affects the stability of the hybrid. 
Hybridization buffers generally contain blocking agents 
such as Denhardt's solution (Sigma Chemical Co., St. 
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Louis, Mo.), denatured salmon sperm DNA, tRNA, milk 
powders (BLOTTO) , heparin or SDS, and a Na" source, such as 
SSC (lx SSC: 0.15 M sodium chloride, 15 mM sodium citrate) 
•or SSPE (lx SSPE: 1.8 M NaCl , 10 mM NaH 2 PO,, 1 mM EDTA, pH 
5 7.7). By decreasing the ionic concentration of the 
buffer, the stability of the hybrid is increased. 
Typically, hybridization buffers contain from between 10 
mM - 1 M Na*. The addition of destabilizing or denaturing 
agents such as formamide, tetralkylammonium salts, 

10 guanidinium cations or thiocyanate cations to the 
hybridization solution will alter the T„ of a hybrid. 
Typically, formamide is used at a concentration of up to 
50% to allow incubations to be carried out at more 
convenient and lower temperatures. Formamide also acts to 

15 reduce non-specific background when using RNA probes. 

As an illustration, a nucleic acid molecule 
encoding a variant zampl polypeptide can be hybridized 
with a nucleic acid molecule having at least a portion of 
the nucleotide sequence of SEQ ID N0s:l, 4, 9 or 11 (or 

20 their complements) at 42°C overnight in a solution 
comprising 50% formamide, 5x SSC (lx SSC: 0.15 M sodium 
chloride and 15 mM sodium citrate) , 50 mM sodium phosphate 
(pH 7.S), 5x Denhardt's solution (lOOx Denhardt's 
solution: 2% (w/v) Ficoll 400, 2% (w/v) 

25 polyvinylpyrrolidone, and 2% (w/v) bovine serum albumin), 
10% dextran sulfate, and 20 |lg/ml denatured, sheared 
salmon sperm DNA. One of skill in the art can devise 
variations of these hybridization conditions. For 
example, the hybridization mixture can be incubated at a 

30 higher or lower temperature, such as about 65°C, in a 
solution that does not contain formamide. Moreover, 
premixed hybridization solutions are available (e.g., 
EXPRESSHYB Hybridization Solution from CLONTECH 
Laboratories, Inc.), and hybridization can be performed 

35 according to the manufacturer's instructions. 

Following hybridization, the nucleic acid 
molecules can be washed to remove non-hybridized nucleic 



acid molecules under stringent conditions, or under highly- 
stringent conditions. Typical stringent washing 
conditions include washing in a solution of 0.5x-2x SSC 
with 0.1% sodium dodecyl sulfate (SDS) at 55-65°C. That 
5 is, nucleic acid molecules encoding a variant zampl 
polypeptide hybridize with a nucleic acid molecule having 
at least a portion of the nucleotide sequence of SEQ ID 
NOs:l, 4, 9 or 11 (or their complements) under stringent 
washing conditions, in which the wash stringency is 

10 equivalent to 0.5x-2x SSC with 0.1% SDS at 50-65°C, 
including 0 . 5x SSC with 0.1% SDS at 55°C, or 2x SSC with 
0.1% SDS at 65°C. One of skill in the art can readily 
devise equivalent conditions, for example, by substituting 
SSPE for SSC in the wash solution. 

15 Typical highly -stringent washing conditions 

include washing in a solution of 0.1x-0.2x SSC with 0.1% 
sodium dodecyl sulfate (SDS) at 50-65°C. In other words, 
nucleic acid molecules encoding a variant zampl 
polypeptide hybridize with a nucleic acid molecule having 

20 at least a portion of the nucleotide sequence of SEQ ID 
N0s:l, 4, 9 or 11 (or their complements) under highly 
stringent washing conditions, in which the wash stringency 
is equivalent to 0.1x-0.2x SSC with 0.1% SDS at 50-65°C, 
including 0 . lx SSC with 0.1% SDS at 50°C, or 0.2x SSC with 

25 0.1% SDS at 65°C. 

The present invention also contemplates zampl 
variant nucleic acid molecules that can be identified 
using two criteria: a determination of the similarity 
between the encoded polypeptide with the amino acid 

3 0 sequence of SEQ ID N0s:2 or 10, and a hybridization assay, 
as described above. Such zampl variants include nucleic 
acid molecules (1) that hybridize with at least a portion 
of a nucleic acid molecule having the nucleotide sequence 
of SEQ ID NO : 1 , 4, 9 or 11 (or their complements) under 

3 5 stringent washing conditions, in which the wash stringency 
is equivalent to 0.5X-2X SSC with 0.1% SDS at 50-65°C, and 
(2) that encode a polypeptide having at least 80%, at 
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least 90%, at least 95% or greater than 95% sequence 
identity to the amino acid sequence of SEQ ID NOs : 2 or 10. 
Alternatively, zarapl variants can be characterized as 
nucleic acid molecules (1) that hybridize with a nucleic 
5 acid molecule having at least a portion of the nucleotide 
sequence of SEQ ID NO : 1 , 4, 9 or 11 {or their complements) 
under highly stringent washing conditions, in which the 
wash stringency is equivalent to 0.1X-0.2X SSC with 0.1% 
SDS at 50-65°C, and (2) that encode a polypeptide having 
O 10 at least 80%, at least 90%, at least 95% or greater than 
,J 95% sequence identity to the amino acid sequence of SEQ ID 

M NOs: 2 or 10. 

C~ The present invention also provides isolated 

nV zampl polypeptides that are substantially homologous to 

^ 15 the polypeptides of SEQ ID NO: 2 and SEQ ID NO: 10 and their 
y species homologs/orthologs . The term "substantially 

homologous" is used herein to denote polypeptides having 
50%, preferably 60%, more preferably at least 80%, 
sequence identity to the sequences shown in SEQ ID NO: 2 or 
20 SEQ ID NO: 10 or their orthologs or paralogs . Such 
polypeptides will more preferably be at least 90% 
identical, and most preferably 95% or more identical to 
SEQ ID NO: 2 or SEQ ID NO: 10 or its orthologs or paralogs. 
Percent sequence identity is determined by conventional 
25 methods. See, for example, Altschul et al . , Bull . Math. 
Bio . 48.: 603-616, 1986 and Henikoff and Henikoff, Proc ■ 
Natl. Acad. Sci. USA 89.: 10915 - 109 19 , 1992. Briefly, two 
amino acid sequences are aligned to optimize the alignment 
scores using a gap opening penalty of 10, a gap extension 
3 0 penalty of 1, and the "blosum 62" scoring matrix of 
Henikoff and Henikoff (ibid. ) as shown in Table 3 (amino 
acids are indicated by the standard one-letter codes) . 
The percent identity is then calculated as: 
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Total number of identical matches 

__ x 100 

[length of the longer sequence plus the 
5 number of gaps introduced into the longer 

sequence in order to align the two 
sequences] 
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Sequence identity of polynucleotide molecules is 
determined by similar methods using a ratio as disclosed 
above . 

Those skilled in the art appreciate that there 
5 are many established algorithms available to align two 
amino acid sequences. The "FASTA" similarity search 
algorithm of Pearson and Liptnan is a suitable protein 
alignment method for examining the level of identity 
shared by an amino acid sequence disclosed- herein and the 
S| 10 amino acid sequence of a putative variant zampl. The 
CI FASTA algorithm is described by Pearson and Lipman, Proc . 

Nat. Acad. Sci. USA 85:2444, 19S8, and by Pearson, Meth. 
¥ Enzvmol . 183 :63, 1990. 

JJJ Briefly, FASTA first characterizes sequence 

s 15 similarity by identifying . regions shared by the query 
sequence (e.g., SEQ ID NOs:2 or 10) and a test sequence 
that have either the highest density of identities (if the 
ktup variable is 1) or pairs of identities (if ktup=2), 
without considering conservative amino acid substitutions, 
20 insertions, or deletions. The ten regions with the 
highest density of identities are then re-scored by 
comparing the similarity of all paired amino acids using 
an amino acid substitution matrix, and the ends of the 
regions are "trimmed" to include only those residues that 
25 contribute to the highest score. If there are several 
regions with scores greater than the "cutoff" value 
(calculated by a predetermined formula based upon the 
length of the sequence and the ktup value) , then the 
trimmed initial regions are examined to determine whether 
3 0 the regions can be joined to form an approximate alignment 
with gaps. Finally, the highest scoring regions of the 
two amino acid sequences are aligned using a modification 
of the Needleman-Wunsch-S'ellers algorithm (Needleman and 
Wunsch, J. Mol. Biol . 48:444, 1970; Sellers, SIAM J. Appl ■ 
35 Math . 2_S:787, 1974), which allows for amino acid 
insertions and deletions. Preferred parameters for FASTA 
analysis are: ktup=l, gap opening penalty=10, gap 



extension penalty=l, and substitution matrix=BLOSUM62 . 
These parameters can be introduced into a FASTA program by 
modifying the scoring matrix file ("SMATRIX"), as 
explained in Appendix 2 of Pearson, Meth. Enzvmol . 183.: 63, 
1990. 

FASTA can also be used to determine the sequence 
identity of nucleic acid molecules using a ratio as 
disclosed above. For nucleotide sequence comparisons, the 
ktup value can range between one to six, preferably from 
three to six, most preferably three, with other parameters 
set as default . 

The present invention includes nucleic acid 
molecules that encode a polypeptide having one or more 
"conservative amino acid substitutions," compared with the 
amino acid sequence of SEQ ID NOs : 2 or 10. Conservative 
amino acid substitutions can be based upon the chemical 
properties of the amino acids. That is, variants can be 
obtained that contain one or more amino acid substitutions 
of SEQ ID NOs : 2 or 10, in which an alkyl amino acid is 
substituted for an alkyl amino acid in a zampl amino acid 
sequence, an aromatic amino acid is substituted for an 
aromatic amino acid in a zampl amino acid sequence, a 
sulfur-containing amino acid is substituted for a sulfur- 
containing amino acid in a zampl amino acid sequence, a 
hydroxy- containing amino acid is substituted for a 
hydroxy-containing amino acid in a zampl amino acid 
sequence, an acidic amino acid is substituted for an 
acidic amino acid in a zampl amino acid sequence, a basic 
amino acid is substituted for a basic amino acid in a 
zampl amino acid sequence, or a dibasic monocarboxylic 
amino acid is substituted for a dibasic monocarboxylic 
amino acid in a zampl amino acid sequence. 

Among the common amino acids, for example, a 
"conservative amino acid substitution" is illustrated by a 
substitution among amino acids within each of the 
following groups: (1) glycine, alanine, valine, leucine, 
and isoleucine, (2) phenylalanine, tyrosine, and 
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tryptophan, (3) serine and threonine, (4) aspartate and 
glutamate, (5) glutamine and asparagine, and (6) lysine, 
arginine and histidine. Table 4 also provided common 
conservative amino acid substitutions. 
5 The BLOSUM62 table is an amino acid substitution 

matrix derived from about 2,000 local multiple alignments 
of protein sequence segments, representing highly 
conserved regions of more than 500 groups of related 
proteins (Henikoff and Henikoff, Proc ■ Natl. Acad. Sci. 
PJ 10 USA .89:10915, 1992). Accordingly, the BLOSUM62 

□ substitution frequencies can be used to define 

*? conservative amino acid substitutions that may be 

h= introduced into the amino acid sequences of the present 

% invention. Although it is possible to design amino acid 

E 15 substitutions based solely, upon chemical properties (as 
discussed above) , the language "conservative amino acid 
substitution" preferably refers to a substitution 
represented by a BLOSUM62 value of greater than -1. For 
example, an amino acid substitution is conservative if the 
20 substitution is characterized by a BL0SUM62 value of 0, 
1, 2, or 3. According to this system, preferred 
conservative amino acid substitutions are characterized by 
a BLOSUM62 value of at least 1 (e.g., 1, 2 or 3), while 
more preferred conservative amino acid substitutions are 
25 characterized by a BLOSUN62 value of at least 2 (e.g., 2 
or 3) . 

Conservative amino acid changes in a zampl gene 
can be introduced by substituting nucleotides for the 
nucleotides recited in SEQ ID NOs : 1 or 9. Such 
3 0 "conservative amino acid" variants can be obtained, for 
example, by oligonucleotide-directed mutagenesis, linker- 
scanning mutagenesis, mutagenesis using the polymerase 
chain reaction, and the like (see Ausubel (1995) at pages 
8-10 to 8-22; and McPherson (ed.), Directed Mutagenesis A 
35 Practical Approach (IRL Press 1991) ) . The ability of such 
variants to promote the anti -microbial or other properties 
of the wild-type protein can be determined using a 
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standard methods, such as the assays described herein. 
Alternatively, a variant zampl polypeptide can be 
identified by the ability to specifically bind anti-zampl 
antibodies . 

Substantially homologous proteins and 

polypeptides are characterized as having one or more amino 
acid substitutions, deletions or additions. These changes 
are preferably of a minor nature, that is conservative 
amino acid substitutions (see Table 4) and other 
substitutions that do not significantly affect the folding 
or activity of the protein or polypeptide; small 
deletions, typically of one to about 30 amino acids; and 
small amino- or carboxyl- terminal extensions, such as an 
amino- terminal methionine residue, a small linker peptide 
of up to about 20-25 residues, or a small extension that 
facilitates purification (an affinity tag) , such as a poly- 
histidine tract, protein A (Nilsson et al . , EMBO J . 
4_:1075, 1985; Nilsson et al . , Methods Enzvmol ■ 198 : 3 . 
1991) , glutathione S transferase (Smith and Johnson, Gene 
67.:31, 1988), maltose binding protein (Kellerman and 
Ferehci, Methods Enzvmol ■ 90.: 459-463, 1982; Guan et al . , 
Gene 67.:21-30, 1987) , thioredoxin, ubiquitin, cellulose 
binding protein, T7 polymerase, or other antigenic epitope 
or binding domain. See,, in general Ford et al . , Protein 
Expression and Purification 2: 95-107, 1991. DNAs 
encoding affinity tags are available from commercial 
suppliers (e.g., Pharmacia Biotech, Piscataway, NJ; New 
England Biolabs, Beverly, MA). Polypeptides comprising 
affinity tags can further comprise a proteolytic cleavage 
site between the zampl polypeptide and the affinity tag. 
Preferred such sites include thrombin cleavage sites and 
factor Xa cleavage sites. 
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Table 4 

Conservative amino acid substitutions 



Basic : 

Acidic : 
Polar: 

Hydrophobic : 
Aromatic : 
Small : 
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argimne 

lysine 

histidine 

glutamic acid 

aspartic acid 

glutamine 

asparagine 

leucine 

isoleucine 

valine 

phenylalanine 

tryptophan 

tyrosine 

glycine 

alanine 

serine 

threonine 

methionine 



The proteins of the present invention can also 
comprise, in addition to the 20 standard amino acids, non- 
naturally occurring amino acid residues. Non-naturally 
25 occurring amino acids include, without limitation, trans- 

3- methylproline, 2 , 4-methanoproline, cis-4 -hydroxyproline , 
trans -4 -hydroxyproline, N-methyl -glycine, allo-threonine, 
methyl threonine , hydroxyethyl -cysteine , 
hydroxyethylhomocysteine, nitroglutamine, homoglutamine , 

30 pipecolic acid, tert - leucine , norvaline, 2- 

azaphenylalanine, 3-azaphenylalanine, 4 -azaphenyl- alanine, 

4- f luorophenylalanine, 4 -hydroxyproline , 6-W-methyl 
lysine, 2-aminoisobutyric acid, isovaline and a-methyl 
serine . Several methods are known in the art for 

35 incorporating non-naturally occurring amino acid residues 
into proteins. For example, an in vitro system can be 
employed wherein nonsense mutations are suppressed using 
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chemically aminoacylated suppressor tRNAs . Methods for 
synthesizing amino acids and aminoacylating tRNA are known 
in the art. Transcription and translation of plasmids 
containing nonsense mutations are carried out in a cell 
5 free system comprising an E. coli S30 extract and 
commercially available enzymes and other reagents. 
Proteins are purified by chromatography. See, for 

example, Robertson et al . , J . Am . Chem . Soc . 113 : 2722 . 
■ 1991; Ellman et al . , Meth. Enzvmol . 202:301, 1991; Chung 
C; 10 et al., Science 259 : 806-09 , 1993; and Chung et al . , Proc ■ 
5 Natl. Acad. Sci . USA 90:10145-49, 1993). In a second 

/" method, translation is carried out in Xenopus oocytes by 

Ui microinjection of mutated mRNA and chemically 

aminoacylated suppressor tRNAs (Turcatti et al . , J. Biol. 
J" 15 Chem. 271 :19991-98, 1996). Within a third method, E. coli 
cells are cultured in the absence of a natural amino acid 
n that is to be replaced (e.g., phenylalanine) and in the 

ImJ 

Ui presence of the desired non-naturally occurring amino 

ir; acid(s) (e.g., 2 -azaphenylalanine , 3 -azaphenylalanine, 4- 

20 azaphenylalanine, or 4-f luorophenylalanine) . The non- 
naturally occurring amino acid is incorporated into the 
protein in place of its natural counterpart. See, Koide 
et al . , Biochem ■ 3.3:7470-76, 1994. Naturally occurring 
amino acid residues can be converted to non-naturally 

25 occurring species by in vitro chemical modification. 
Chemical modification can be combined with site-directed 
mutagenesis to further expand the range of substitutions 
(Wynn and Richards, Protein Sci. 2:395-403, 1993) . 

A limited number of non-conservative amino 

30 acids, amino acids that are not encoded by the genetic 
code, non-naturally occurring amino acids, and unnatural 
amino acids may be substituted for zampl polypeptide amino 
acid residues. "Unnatural amino acids" have been modified 
after protein synthesis, and/or have a chemical structure 

35 in their side chain (s) different from that of the standard 
amino acids. Unnatural amino acids can be chemically 
synthesized, or preferably, are commercially available, 
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and include pipecolic acid, thiazolidine carboxylic acid, 
dehydroproline , 3- and 4-methylproline, and 3,3- 
dimethylproline . 

Essential amino acids in the zampl polypeptides 
5 of the present invention can be identified according to 
procedures known in the art, such as site-directed 
mutagenesis, or alanine -scanning mutagenesis (Cunningham 
and Wells, Science 244 : 1081-1085, 1989) . In the latter 
technique, single alanine mutations are introduced at 

■Z" 10 every residue in the molecule, and the resultant mutant 

O 

□ molecules are tested for biological activity (e.g. , anti- 

"6 microbial activity) to identify amino acid residues that 

■Ui are critical to the activity of the molecule. See also, 

U Hilton et al . , J. Biol. Chem. 271 :4699-4708, 1996. Sites 

~~ 15 of ligand-receptor or other biological interaction can 
-p also be determined by physical analysis of structure, as 

S: determined by such techniques as nuclear magnetic 

[H resonance, crystallography, electron diffraction or 

photoaf f inity labeling, in conjunction with mutation of 

HI 

20 putative contact site amino acids. See, for example, de 
Vos et al., Science 255:306-312, 1992; Smith et al . , J. 
Mol. Biol. 224 :899-904, 1992; Wlodaver et al . , FEBS Lett . 
309 :59-64 . 1992. The identities of essential amino acids 
can also be inferred from analysis of homologies with 

25 related p-defensins. 

Multiple amino acid substitutions can be made 
and tested using known methods of mutagenesis and 
screening, such as those disclosed by Reidhaar-Olson and 
Sauer ( Science 241 :53-57, 1988) or Bowie and Sauer ( Proc . 

30 Natl. Acad. Sci . USA 86:2152-2156, 1989). Briefly, these 
authors disclose methods for simultaneously randomizing 
two or more positions in a polypeptide, selecting for 
functional polypeptide, and then sequencing the 
mutagenized polypeptides to determine the spectrum of 

35 allowable substitutions at each position. Other methods 
that can be "used include phage display (e.g., Lowman et 
al., Biochem. 30:10832-10837, 1991; Ladner et al . , U.S. 
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Patent No. 5,223,409; Huse, WIPO Publication WO 92/06204) 
and region-directed mutagenesis (Derbyshire et al., Gene 
46=145, 1986; Ner et al . , DNA 7:127, 1988). 

Variants of the disclosed zampl DNA and 
5 polypeptide sequences can be generated through DNA 
shuffling as disclosed by Stemmer, Nature 370 :389-91. 
1994, Stemmer, Proc. Natl. Acad. Sci. USA 91:10747-51, 
1994 and WIPO Publication WO 97/20078. Briefly, variant 
DNAs are generated by in vitro homologous recombination by 

10 random fragmentation of a parent DNA followed by 
reassembly using PCR, resulting in randomly introduced 
point mutations. This technique can be modified by using 
a family of parent DNAs, such as allelic variants or DNAs 
from different species, to introduce additional 

15 variability into the process. Selection or screening for 
the desired activity, followed by additional iterations of 
mutagenesis and assay provides for rapid "evolution" of 
sequences by selecting for desirable mutations while 
simultaneously selecting against detrimental changes. 

20 Mutagenesis methods as disclosed above can be 

combined with high- throughput , automated screening methods 
to detect activity of cloned, mutagenized polypeptides in 
host cells. Mutagenized DNA molecules that encode active 
polypeptides (e.g., anti-microbial activity) can be 

25 recovered from' the host cells and rapidly sequenced using 
modern equipment. These methods allow the rapid 

determination of the importance of individual amino acid 
residues in a polypeptide of interest, and can be applied 
to polypeptides of unknown structure. 

30 Using the methods discussed above, one of 

ordinary skill in the art can identify and/or prepare a 
variety of polypeptides that are substantially homologous 
to residues 1 to 65 of SEQ ID NO: 2 or to residues 1 to 67 
of SEQ ID NO: 10, residues 30 or 31 to residues 63 or 64 

35 of SEQ ID NO: 2 or polypeptides of SEQ ID NOs:14, 15 or 16, 
or allelic variants thereof and retain the anti-microbial 
properties of the wild- type protein. Such polypeptides 
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may include additional amino acids from affinity tags and 
the like. Such polypeptides may also include additional 
polypeptide segments as generally disclosed above. 

The polypeptides of the present invention, 
5 including full-length proteins, fragments thereof and 
fusion proteins, can be produced in genetically engineered 
host cells according to conventional techniques. However, 
host cells must be selected with some care as a result of 
the anti-microbial activity of the molecules of the 

10 present invention. For example, any cell culture-based 
system must be evaluated, because zampl polypeptides, 
fragments, fusion proteins, antibodies, agonists, or 
antagonists may kill the host cell as a part of an anti- 
microbial function. Zampl polypeptides are of a small 

15 enough size to permit preparation by PCR or other protein 
chemistry techniques to avoid any potential host cell 
toxicity problems. Alternatively, native or engineered 
precursor proteins, prior to post-translational cleavage 
to yield the mature zampl polypeptide, are inactive, 

20 thereby limiting host cell cytotoxicity prior to lysosomal 
packaging. See, for example, Lehrer et al . , Cell 64.: 229- 
30, 1991. Thus, precursor proteins to zampl polypeptides 
may be produced in microbial cell culture. 

Suitable host cells are those cell types that 

25 can be transformed or transf ected with exogenous DNA and 
grown in culture, and include bacteria, fungal cells, and 
cultured higher eukaryotic cells. Eukaryotic cells, 
particularly cultured cells of multicellular organisms, 
are preferred. Techniques for manipulating cloned DNA 

30 molecules and introducing exogenous DNA into a variety of 
host cells are disclosed by Sambrook et al . , Molecular 
Cloning : A Laboratory Manual . 2nd ed. , Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY, 1989, and 
Ausubel et al . (eds.), Current Protocols in Molecular 

35 Biology , John Wiley and Sons, Inc., NY, 1987. 

In general, a DNA sequence encoding a zampl 
polypeptide of the present invention is operably linked to 
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other genetic elements required for its expression, 
generally including a transcription promoter and 
terminator within an expression vector. The vector will 
also commonly contain one or more selectable markers and 
one or more origins of replication, although those skilled 
in the art will recognize that within certain systems 
selectable markers may be provided on separate vectors, 
and replication of the exogenous DNA may be provided by- 
integration into the host cell genome. Selection of 
promoters, terminators, selectable markers, vectors and 
other elements is a matter of routine design within the 
level of ordinary skill in the art. Many such elements 
are described in the literature and are available through 
commercial suppliers. 

To direct a zampl polypeptide into the secretory 
pathway of a host cell, a secretory signal sequence (also 
known as a leader sequence, prepro sequence or pre 
sequence) is provided in the expression vector. The 
secretory signal sequence may be that of the zampl 
polypeptide, or may be derived from another secreted 
protein (e.g., t-PA) or synthesized de novo. The 
secretory signal sequence is joined to the zampl 
polypeptide-encoding DNA sequence in the correct reading 
frame. Secretory signal sequences are commonly positioned 
5' to the DNA sequence encoding the polypeptide of 
interest, although certain signal sequences may be 
positioned elsewhere in the DNA sequence of interest (see, 
e.g., Welch et al . , U.S. Patent No. 5,03 7,74 3; Holland et 
al., U.S. Patent No. 5,143,830). 

Alternatively, the secretory signal sequence 
contained in the polypeptides of the present invention is 
used to direct other polypeptides into the secretory 
pathway. The present invention provides for such fusion 
polypeptides. A signal fusion polypeptide can be made 
wherein a secretory signal sequence derived from amino 
acid residues 1-18 or 12 of SEQ ID NO: 2 or amino acid 
residues 1-20 or 22 of SEQ ID N0:10, is operably linked to 
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another polypeptide using methods known in the art and 
disclosed herein. The secretory signal sequence contained 
in the fusion polypeptides of the present invention is 
preferably fused amino-terminally to an additional peptide 
5 to direct the additional peptide into the secretory 
pathway. Such constructs have numerous applications known 
in the art. For example, these novel secretory signal 
sequence fusion constructs can direct the secretion of an 
M active component of a normally non-secreted protein, such 

~ 10 as a receptor. Such fusions may be used in vivo or in 

vitro to direct peptides through the secretory pathway. 
f'° Cultured mammalian cells are also preferred 

5tj hosts within the . present invention. . Methods for 

« ! introducing exogenous DNA into mammalian host cells 

J== 15 include calcium phosphate-mediated transfection (Wigler et 
jjj al., Cell 14:725, 1978; Corsaro and Pearson, Somatic Cell 

h( Genetics 2=603, 1981: Graham and Van der Eb, Virology 

p 5J2:456, 1973), electroporation (Neumann et al . , EMBO J . 

Hi JL:841-845, 1982), DEAE-dextran mediated transfection 

20 (Ausubel et al . , eds., Current Protocols in Molecular 
Biology , John Wiley and Sons, Inc., NY, 1987), liposome- 
mediated transfection (Hawley-Nelson et al . , Focus 15:73, 
1993; Ciccarone et al . , Focus 15:80, 1993), and viral 
vectors (A. Miller and G. Rosman, BioTechniaues 7:980-90, 
25 1989; Q. Wang and M. Finer, Nature Med. 2.:714-16, 1996) . 
The production of recombinant polypeptides in cultured 
mammalian cells is disclosed, for example, by Levinson et 
al., U.S. Patent No. 4,713,339; Hagen et al., U.S. Patent 
No. 4,784,950; Palmiter et al . , U.S. Patent No. 4,579,821; 
3 0 and Ringold, U.S. Patent No. 4,656,134. Preferred 
cultured mammalian cells include the COS-1 (ATCC No. CRL 
1650), COS-7 (ATCC No. CRL 1651), BHK 570 (ATCC No. CRL 
10314), 293 (ATCC No. CRL 1573; Graham et al . , J. Gen . 
Virol . 36. : 59-72, 1977) and Chinese hamster ovary (e.g. 
35 CHO-K1; ATCC No. CCL 61) cell lines.. Additional suitable 
cell lines are known in the art and available from public 
depositories such as the American Type Culture Collection, 
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Rockville, Maryland. In general, strong transcription 
promoters are preferred, such as promoters from SV-40 or 
cytomegalovirus. See, e.g., U.S. Patent No. 4,955,288. 
Other suitable promoters include those from 
5 metallothionein genes (U.S. Patent Nos . 4,579,821 and 
4,601,978) and the adenovirus major late promoter. 

Drug selection is generally used to select for 
cultured mammalian cells into which foreign DNA has been 
inserted. Such cells are commonly referred to as 

10 " transf ectants" . Cells that have been cultured in the 
presence of the selective agent and are able to pass the 
gene of interest to their progeny are referred to as 
"stable transf ectants . " A preferred selectable marker is 
a gene encoding resistance to the antibiotic neomycin. 

15 Selection is carried out in the presence of a neomycin- 
type drug, such as G-418 or the like. Selection systems 
may also be used to increase the expression level of the 
gene of interest, a process referred to as 
"amplification." Amplification is carried out by 

2 0 culturing transf ectants in the presence of a low level of 

the selective agent and then increasing the amount of 
selective agent to select for cells that produce high 
levels of the products of the introduced genes. A 
preferred amplifiable selectable marker is dihydrof olate 

25 reductase, which confers resistance to methotrexate. 
Other drug resistance genes (e.g., hygromycin resistance, 
multi-drug resistance, puromycin acetyltransf erase) can 
also be used. Alternative markers that introduce an 
altered phenotype, such as green fluorescent protein, or 

30 cell surface proteins such as CD4 , CD8, Class I MHC, 
placental alkaline phosphatase may be used to sort 
transfected cells from untransf ected cells by such means 
as FACS sorting or magnetic bead separation technology. 

Other higher eukaryotic cells can also be used 

3 5 as hosts, including plant cells, insect cells and avian 

cells. The use of Agrobacterium rhizogenes as a vector 
for expressing genes in plant cells has been reviewed by 
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Sinkar et al . , J. Biosci . CBangalorej 11:47-58, 1987. 
Transformation of insect cells and production of foreign 
polypeptides therein is disclosed by Guarino et al., U.S. 
Patent No. 5,162,222 and WIPO publication WO 94/06463. 
Insect cells can be infected with recombinant baculovirus, 
commonly derived from Autographa calif ornica nuclear 
polyhedrosis virus (AcNPV) . DNA encoding the zampl 

polypeptide is inserted into the baculoviral genome in 
place of the AcNPV polyhedrin gene coding sequence by one 
of two methods. . The first is the traditional method of 
homologous DNA recombination between wild-type AcNPV and a 
transfer vector containing the zampl flanked by AcNPV 
sequences. Suitable insect cells, e.g. SF9 cells, are 
infected with wild-type AcNPV and transfected with a 
transfer vector comprising a zampl polynucleotide operably 
linked to an AcNPV polyhedrin gene promoter, terminator, 
and flanking sequences. See, King and Possee, The 

Baculovirus Expression System: A Laboratory Guide , 

London, Chapman & Hall; O'Reilly et al . , Baculovirus 
Expression Vectors: A Laboratory Manual , New York, Oxford 
University Press., 1994; and, Richardson, C. D., Ed., 
Baculovirus Expression Protocols. Methods in Molecular 
Biology , Totowa, NJ, Humana Press, 1995. Natural 
recombination within an insect cell will result in a 
recombinant baculovirus which contains zampl driven by the 
polyhedrin promoter. Recombinant viral stocks are made by 
methods commonly used in the art . 

The second method of making recombinant 
baculovirus utilizes a transposon-based system described 
by Luckow (Luckow et al . , J Virol . .67:4566-79, 1993). 
This system is sold in the Bac-to-Bac kit (Life 
Technologies, Rockville, MD) . This system utilizes a 
transfer vector, pFastBacl™ (Life Technologies) containing 
a Tn7 transposon to move the DNA encoding the zampl 
polypeptide into a baculovirus genome maintained in E. 
coli as a large plasmid called a "bacmid." The pFastBacl™ 
transfer vector utilizes the AcNPV polyhedrin promoter to 
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drive the expression of the gene of interest, in this case 
zampl . However, pFastBacl™ can be modified to a 

considerable degree. The polyhedrin promoter can be 
removed and substituted with the baculovirus basic protein 
5 promoter (also known as Pcor, p6 . 9 or MP promoter) which 
is expressed earlier in the baculovirus infection, and has 
been shown to be advantageous for expressing secreted 
proteins. See, Hill-Perkin and Possee, J. Gen. Virol . 
71:971-6, 1990; Bonning. et al . , J. Gen. Virol . 75:1551-6, 

10 1994; and, Chazenbalk and Rapoport, J. Biol. Chem . 
270 -.1543-9 . 1995. In such transfer vector constructs, a 
short or long version of the basic protein promoter can be 
used. Moreover, transfer vectors can be constructed which 
replace the native zampl secretory signal sequences with 

15 secretory signal sequences derived from insect proteins. 
For example, a secretory signal sequence from Ecdysteroid 
Glucosyltransf erase (EGT) , honey bee Melittin (Invitrogen, 
Carlsbad, CA) , or baculovirus gp67 (PharMingen, San Diego, 
CA) can be used in constructs to replace the native zampl 

20 secretory signal sequence. In addition, transfer vectors 
can include an in- frame fusion with DNA encoding an 
epitope tag at the C- or N- terminus of the expressed zampl 
polypeptide, for example, a Glu-Glu epitope tag 
(Grussenmeyer et al., Proc. Natl. Acad. Sci .. 82 : 7952-4 , 

25 1985). Using a technique known in the art, a transfer 
vector containing zampl is transformed into E. coli, and 
screened for bacmids which contain an interrupted lacZ 
gene indicative of recombinant baculovirus. The bacmid 
DNA containing the recombinant baculovirus genome is 

3 0 isolated, using common techniques, and used to transfect 
Spodoptera frugiperda cells, e.g. Sf9 cells. Recombinant 
virus that expresses zampl is subsequently produced. 
Recombinant viral stocks are made by methods commonly used 
the art . 

35 The recombinant virus is used to infect host 

cells, typically a cell line derived from the fall 
armyworm, Spodoptera frugiperda. See, in general, Glick 
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and Pasternak, Molecular Biotechnology : Principles and 
Applications of Recombinant DNA , ASM Press, Washington, 
D.C., 1994. Another suitable cell line is the High FiveO™ 
cell line (Invitrogen) derived from Trichoplusia ni (U.S. 
Patent #5,300,435). Commercially available serum-free 
media are used to grow and maintain the cells. Suitable 
media are Sf900 II™ (Life Technologies) or ESF 921™ 
(Expression Systems) for the Sf9 cells; and Ex-cellO405™ 
(JRH Biosciences, Lenexa, KS) or Express FiveO™ (Life 
Technologies) .' for the T. ni cells. The cells are grown up 
from an inoculation density of approximately 2-5 x 10 s 
cells to a density of 1-2 x 10 s cells at which time a 
recombinant viral stock is added at a multiplicity of 
infection (MOD of 0.1 to 10, more typically near 3. The 
recombinant virus- infected cells typically produce the 
recombinant zampl polypeptide at 12-72 hours post- 
infection and secrete it with varying efficiency into the 
medium. The culture is usually harvested 48 hours post- 
infection. Centrifugation is used to separate the cells 
from the medium (supernatant) . The supernatant containing 
the zampl polypeptide is filtered through micropore 
filters, usually 0.45 urn pore size. Procedures used are 
generally described in available laboratory manuals (King 
and Possee,- ibid . ; O'Reilly et al . , ibid . .- Richardson, 
ibid . ) . Subsequent purification of the zampl polypeptide 
from the supernatant can be achieved using methods 
described herein. 

Fungal cells, including yeast cells, can also be 
used within the present invention. Yeast species of 
particular interest in this regard include Saccharomyces 
cerevisiae, Pichia pastoris, and Pichia methanolica . 
Methods for transforming S. cerevisiae cells with 
exogenous DNA and producing recombinant polypeptides 
therefrom are disclosed by, for example, Kawasaki, U.S. 
Patent No. 4,599,311; Kawasaki et al . , U.S. Patent No. 
4,931,373; Brake, U.S. Patent No. 4,870,008; Welch et al., 
U.S. Patent No. 5,037,743; and Murray et al . , U.S. Patent 
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No. 4,845,075. Transformed cells are selected by 

phenotype determined by the selectable marker, commonly 
drug resistance or the ability to grow in the absence of a 
particular nutrient (e.g., leucine). A preferred vector 
5 system for use in Saccharomyces cerevisiae is the POT1 
vector system disclosed by Kawasaki et al . (U.S. Patent 
No. 4,931,373), which allows transformed cells to be 
selected by growth in glucose-containing media. Suitable 
promoters and terminators for use in yeast include those 

10 from glycolytic enzyme genes (see, e.g., Kawasaki, U.S. 
Patent No. 4,599,311; Kingsman et al . , U.S. Patent No. 
4,615,974; and Bitter, U.S. Patent No. 4,977,092) and 
alcohol dehydrogenase genes. See also U.S. Patents Nos . 
4,990,446; . 5,063,154; 5,139,936 and 4,661,454. 

15 Transformation systems for other yeasts, including 
Hansenula polymorpha, Schizosaccharomyces pombe, 
Kluyveromyces lactis, Kluyveromyces fragilis, Ustilago 
maydis, Pichia pastoris, Pichia methanolica, Pichia 
guillermondii and Candida maltosa are known in the art. 

20 See, for example, Gleeson et al . , J. Gen. Microbiol . 
132 :3459-65. 1986 and Cregg, U.S. Patent No. 4,882,279. 
Aspergillus cells may be utilized according to the methods 
of McKnight et al., U.S. Patent No. 4,935,349. Methods 
for transforming Acremonium chrysogenum are disclosed by 

25 Sumino et al., U.S. Patent No. 5,162,228. Methods for 
transforming Neurospora are disclosed by Lambowitz, U.S. 
Patent No . 4 , 486 , 533 . 

The use of Pichia methanolica as host for the 
production of recombinant proteins is disclosed in WIPO 

30 Publications WO 97/17450, WO 97/17451, WO 98/02536, and WO 
98/02565. DNA molecules for use in transforming P. 
methanolica will commonly be prepared as double- stranded, 
circular plasmids, which are preferably linearized prior 
to transformation. For polypeptide production in P. 

35 methanolica, it is preferred that the promoter and 
terminator in the plasmid be that of a P. methanolica 
gene, such as a P. methanolica alcohol utilization gene 



57 



(AUG1 or AUG2) . Other useful promoters include those of 
the dihydroxyacetone synthase (DHAS) , formate 

dehydrogenase (FMD) , and catalase (CAT) genes. To 
facilitate integration of the . DNA into the host 

5 chromosome, it is preferred to have the entire expression 
segment of the plasmid flanked at both ends by host DNA 
sequences. A preferred selectable marker for use in 
Pichia methanolica is a P. methanolica ADE2 gene, which 
encodes phosphoribosyl-5-aminoimidazole carboxylase (AIRC; 

t° 10 EC 4.1.1.21), which allows ade2 host cells to grow in the 

Us 

□ absence of adenine. For large-scale, industrial processes 

. .=* 

=* where it is desirable to minimize the use of methanol, it 

U is preferred to use host cells in which both methanol 

utilization genes {AUG1 and AUG2) are deleted. For 
15 production of secreted proteins, host cells deficient in 
O vacuolar protease genes {PEP4 and PRB1) are preferred. 

Electroporation is used to facilitate the introduction of 
Lfi a plasmid containing DNA encoding a polypeptide of 

y interest into P. methanolica cells. It is preferred to 

fli 

20 transform P. methanolica cells by electroporation using 
an exponentially decaying, pulsed electric field having a 
field strength of from 2.5 to 4.5 kv/cm, preferably about 
3.75 kV/cm, and a time constant (t) of from 1 to 40 
milliseconds, most preferably about 20 milliseconds. 

25 Prokaryotic host cells, including strains of the 

bacteria Escherichia coli, Bacillus and other genera are 
also useful host cells within the present invention. 
Techniques for transforming these hosts and expressing 
foreign DNA sequences cloned therein are well known in the 

30 art (see, e.g., Sambrook et al . , ibid.). When expressing 
a zampl polypeptide in bacteria such as E. coli, the 
polypeptide may be retained in the cytoplasm, typically as 
insoluble granules, or may be directed to the periplasmic 
space by a bacterial secretion sequence. In the former 

35 case, the cells are lysed, and the granules are recovered 
and denatured using, for example, guanidine isothiocyanate 
or urea. The denatured polypeptide can then be refolded 
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and dimerized by diluting the denaturant, such as by 
dialysis against a solution of urea and a combination of 
reduced and oxidized glutathione, followed by dialysis 
against a buffered saline solution. In the latter case, 
5 the polypeptide can be recovered from the periplasmic 
space in a soluble and functional form by disrupting the 
cells (by, for example, sonication or osmotic shock) to 
release the contents of the periplasmic space and 
recovering the protein, thereby obviating the need for 
pi 10 denaturation and refolding. 

\t) Transformed or transfected host cells are 

cultured according to conventional procedures in a culture 
01 medium containing nutrients and other components required 

for' the growth of the chosen host cells. A variety of 
CI 15 suitable media, including defined media and complex media, 
are known in the art and generally include a carbon 
yi source, a nitrogen source, essential amino acids, vitamins 

and minerals. Media may also contain such components as 
growth factors or serum, as required. The growth medium 
20 will generally select for cells containing the exogenously 
added DNA by, for example, drug selection or deficiency in 
an essential nutrient which is complemented by the 
selectable marker carried on the expression vector or co- 
transfected into the host cell. P. methanolica cells are 
25 cultured in a medium comprising adequate sources of 
carbon, nitrogen and trace nutrients at a temperature of 
about 25°C to 35°C. Liquid cultures are provided with 
sufficient aeration by conventional means, such as shaking 
of small flasks or sparging of f ermentors . A preferred 
30 culture medium for P. methanolica is YEPD (2% D-glucose, 
2% Bacto™ Peptone (Difco Laboratories, Detroit, MI), 1% 
Bacto™ yeast extract (Difco Laboratories), 0.004% adenine 
and 0.006% L-leucine) . 

Expressed recombinant zampl polypeptides (or 
35 chimeric zampl polypeptides) can be purified using 
fractionation and/or conventional purification methods and 
media. Ammonium sulfate precipitation and acid or 
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chaotrope extraction may be used for fractionation of 
samples. Exemplary purification steps may include 

hydroxyapatite, size exclusion, FPLC and reverse-phase 
high performance liquid chromatography. Suitable anion 
5 exchange media include derivatized dextrans, agarose, 
cellulose, polyacrylamide , specialty silicas, and the 
like. PEI, DEAE, QAE and Q derivatives are preferred, 
with DEAE Fast-Flow Sepharose (Pharmacia, Piscataway, NJ) 
being particularly preferred. Exemplary chromatographic 
10 media include those media derivatized with phenyl, butyl, 
0 or octyl groups, such as Phenyl -Sepharose FF (Pharmacia), 

r 1 Toyopearl butyl 650 (Toso Haas, Montgomeryville, PA), 

y. Octyl -Sepharose (Pharmacia) and the like; or polyacrylic 

resins, such as Amberchrom CG 71 (Toso Haas) and the like. 
*" 15 Suitable solid supports include glass beads, silica-based 
Cl resins, cellulosic resins, agarose beads, cross-linked 

h \ 

i.1 agarose beads, polystyrene beads, cross-linked 

u! polyacrylamide resins and the like that are insoluble 

t! under the conditions in which they are to be used. These 

20 supports may be modified with reactive groups that allow 
attachment of proteins by amino groups, carboxyl groups, 
sulfhydryl groups, hydroxyl groups and/or carbohydrate 
moieties. Examples of coupling chemistries include 

cyanogen bromide activation, N-hydrpxysuccinimide 
25 activation, epoxide activation, sulfhydryl activation, 
hydrazide activation, and carboxyl and amino derivatives 
for carbodiimide coupling chemistries. These and other 
solid media are well known and widely used in the art, and 
are available from commercial suppliers. Methods for 
3 0 binding receptor polypeptides to support media are well 
known in the art. Selection of a particular method is a 
matter of routine design and is determined in part by the 
properties of the chosen support. See, for example. 
Affinity Chromatography: Principles & Methods . Pharmacia 
35 LKB Biotechnology, Uppsala, Sweden, 1988. 

The polypeptides of the present invention can be 
isolated by exploitation of their structural properties. 



For example, immobilized metal ion adsorption (IMAC) 
chromatography can be used to purify histidine-rich 
proteins or proteins having a His-affinity tag. Briefly, 
a gel is first charged with divalent metal ions to form a 
chelate (Sulkowski,. Trends in Biochem. 3:1-1, 1985). 
Histidine-rich proteins will be adsorbed to this matrix 
with differing affinities, depending upon the metal ion 
used, and will be eluted by competitive elution, lowering 
the pH, or use of strong chelating agents. Other methods 
of purification include purification of glycosylated 
proteins by lectin affinity chromatography and ion 
exchange chromatography ( Methods in Enzymol . , Vol. 182, 
"Guide to Protein Purification", M. Deutscher, (ed.), 
Acad. Press, Sari Diego, 1990, pp. 529-39). Within 
additional embodiments of the invention, a fusion of the 
polypeptide of interest and an affinity tag (e.g., 
maltose -binding protein, an immunoglobulin domain) may be 
constructed to facilitate purification. 

Protein refolding (and optionally reoxidation) 
procedures may be advantageously used. It is preferred to 
purify the protein to >80% purity, more preferably to >90% 
purity, even more preferably >95%, and particularly 
preferred is a pharmaceutically pure state, that is 
greater than 99.9% pure with respect to contaminating 
macromolecules, particularly other proteins and nucleic 
acids, and free of infectious and pyrogenic agents. 
Preferably, a purified protein is substantially free of 
other proteins, particularly other proteins of animal 
origin. 

Zampl polypeptides or fragments thereof may also 
be prepared through chemical synthesis. Zampl 
polypeptides may be monomers or multimers,- glycosylated or 
non-glycosylated; pegylated or non-pegylated; amidated or 
non-amidated; sulfated or non-sulf ated; and may or may not 
include an initial methionine amino acid residue. For 
example, zampl polypeptides can be synthesized by 
exclusive solid phase synthesis, partial solid phase 
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methods, fragment condensation or classical solution 
synthesis. The polypeptides are preferably prepared by 
solid phase peptide synthesis, for example as described by 
Merrifield, J. Am. Chem. Soc . 85_:2149,. 1963. The 
5 synthesis is carried out with amino acids that are 
protected at the alpha-amino terminus. Trif unctional 
amino acids with labile side-chains are also protected 
with suitable groups to prevent undesired chemical 
y reactions from occurring during the assembly of the 

Q 10 polypeptides. The alpha-amino protecting group is 

"I selectively removed to allow subsequent reaction, to take 

w, place at the amino- terminus . The conditions for the 

f* removal of the alpha-amino protecting group do not remove 

y t ■ 

jji the side-chain protecting groups. 

15 The alpha-amino protecting groups are those 

known to be useful in the art of stepwise polypeptide 
O synthesis. Included are acyl type protecting groups 

£: (e.g., formyl, tr if luoroacetyl , acetyl), aryl type 

fjj protecting groups (e.g. , biotinyl) , aromatic urethane type 

20 protecting groups [e.g., benzyloxycarbonyl (Cbz) , 
substituted benzyloxycarbonyl and 9-f luorenylmethyloxy- 
carbonyl (Fmoc) ] , aliphatic urethane protecting groups 
[e.g., t-butyloxycarbonyl (tBoc) , isopropyloxycarbonyl , 
cyclohexloxycarbonyl] and alkyl type protecting groups 
25 (e.g., benzyl, triphenylmethyl) . The preferred protecting 
groups are tBoc and Fmoc . 

The side-chain protecting groups selected must 
remain intact during coupling and not be removed during 
the deprotection of the amino-terminus protecting group or 
30 during coupling conditions. The side-chain protecting 
groups must also be removable upon the completion of 
synthesis using reaction conditions that will not alter 
the finished polypeptide. In tBoc chemistry, the side- 
chain protecting groups for trif unctional amino acids are 
35 mostly benzyl based. In Fmoc chemistry, they are mostly . 
tert-butyl or trityl based. 
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In tBoc chemistry, the preferred side-chain 
protecting groups are tosyl for arginine, cyclohexyl for 
aspartic acid, 4-methylbenzyl (and acetamidomethyl) for 
cysteine, benzyl for glutamic acid, serine and threonine, 
5 benzyloxymethyl (and dinitrophenyl) for histidine, 2-C1- 
benzyloxycarbonyl for lysine, formyl for tryptophan and 2- 
bromobenzyl for tyrosine. In Fmoc chemistry, the 

preferred side-chain protecting groups are 2,2,5,7,8- 
pentamethylchroman- 6-sulf onyl (Pmc) or 2 , 2 , 4 , 6 , 7-penta- 
10 methyldihydrobenzofuran-5-sulfonyl (Pbf) for arginine, 
trityl for asparagine, cysteine, glutamine and histidine, 
tert-butyl for aspartic acid, glutamic acid, serine, 
H threonine and tyrosine, tBoc for lysine and tryptophan. 

~ For the synthesis of phosphopeptides , either 

s 15 direct or post-assembly incorporation of the phosphate 
!-{ group is used. In the direct incorporation strategy, the 

p phosphate group on serine, threonine or tyrosine may be 

protected by methyl, benzyl, or tert-butyl in Fmoc 

C? 

n\ chemistry or by methyl, benzyl or phenyl in tBoc 

20 chemistry. Direct incorporation of phosphotyrosine without 
phosphate protection can also be used in Fmoc chemistry. 
In the post -assembly incorporation strategy, the 
unprotected hydroxyl groups of serine, threonine or 
tyrosine are derivatized on solid phase with di-tert- 
25 butyl-, dibenzyl- or dimethyl-N, N ' -diisopropyl- 

phosphoramidite and then oxidized by tert-butylhydro- 
peroxide . 

Solid phase synthesis is usually carried out 
from the carboxyl- terminus by coupling the alpha-amino 

30 protected (side-chain protected) amino acid to a suitable 
solid support. An ester linkage is formed when the 
attachment is made to a chloromethyl , chlorotrityl or 
hydroxymethyl resin, and the resulting polypeptide will 
have a free carboxyl group at the C-terminus. 

35 Alternatively, when an amide resin such as benzhydrylamine 
or p-methylbenzhydrylamine resin (for tBoc chemistry) and 
Rink amide or PAL resin (for Fmoc chemistry) are used, an 
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amide bond is formed and the resulting polypeptide will 
have a carboxamide group at the C-terminus. These resins, 
whether polystyrene- or polyamide-based or 

polyethyleneglycol-graf ted, with or without a handle or 
5 linker, with or without the first amino acid attached, are 
commercially available, and their preparations have been 
described by Stewart et al . , "Solid Phase Peptide 
Synthesis" (2nd Edition), (Pierce Chemical Co., Rockford, 
IL, 1984) and Bayer and Rapp, Chem. Pept. Prot ■ 2:3, 1986; 
s «. 10 and Atherton et al . , Solid Phase Peptide Synthesis: A 
P Practical Approach , IRL Press, Oxford, 1989. 

ff The C-terminal amino acid, protected at the side 

M chain if necessary, and at the alpha-amino group,, is 

attached to a hydroxylmethyl resin using various 
, 15 activating agents including dicyclohexylcarbodiimide 
(DCC) , N,N' -diisopropylcarbodiimide (DIPCDI) and 

carbonyldiimidazole (CDI) . It can be attached to 
UI chloromethyl or chlorotrityl resin directly in its cesium 

J:! tetramethylammonium salt form or in the presence of 

20 triethylamine (TEA) or diisopropylethylamine (DIEA) . First 
amino acid attachment to an amide resin is the same as 
amide bond formation during coupling reactions. 

Following the attachment to the resin support, 
the alpha-amino protecting group is removed using various 
25 reagents depending on the protecting chemistry (e.g., 
tBoc, Fmoc) . The extent of Fmoc removal can be monitored 
at 300-320 nm or by a conductivity cell. After removal of 
the alpha-amino protecting group, the remaining protected 
amino acids are coupled stepwise in the required order to 
3 0 obtain the desired sequence. 

Various activating agents can be used for the 
coupling reactions including DCC, DIPCDI, 2-chloro-l , 3- 
dimethylimidium hexaf luorophosphate (CIP) , benzotriazol-1- 
yl-oxy-tris- (dimethylamino) -phosphonium hexafluoro- 
35 phosphate (BOP) and its pyrrolidine analog (PyBOP) , bromo- 
tris-pyrrolidino-phosphonium hexaf luorophosphate (PyBrOP) , 
0- (benzotriazol-l-yl) -1,1,3, 3-tetramethyl-uronium 
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hexaf luorophosphate (HBTU) and its tetraf luoroborate 
analog (TBTU) or its pyrrolidine analog (HBPyU) , 0-(7- 
azabenzotriazol-l-yl) -1, 1,3, 3-tetramethyl-uronium 
hexaf luorophosphate (HATU) and its tetraf luoroborate 
5 analog (TATU) or its pyrrolidine analog (HAPyU) . The most 
common catalytic additives used in coupling reactions 
include 4-dimethylaminopyridine (DMAP) , 3 -hydroxy-3 , 4 - 
dihydro-4-oxo-l, 2 , 3-benzotriazine (HODhbt) , N- 

hydroxybenzotriazole (HOBt) and l-hydroxy-7- 

h* 10 azabenzotriazole (HOAt) . Each protected amino acid is 
5{ used in excess (>2.0 equivalents), and the couplings are 

jj usually carried out in N-methylpyrrolidone (NMP) or in 

DMF, CH2C12 or mixtures thereof. The extent of completion 
gi of the coupling reaction can be monitored at each stage, 

15 e.g., by the ninhydrin reaction as described by Kaiser et 
p= al.. Anal. Biochem . 34:595, 1970. 

W After the entire assembly of the desired 

r\ 

T;, peptide, the peptide-resin is cleaved with a reagent with 

□ proper scavengers . The Fmoc peptides are usually cleaved 

20 and deprotected by TFA with scavengers (e.g., H20, 
ethanedi thiol , phenol and thioanisole) . The tBoc peptides 
are usually cleaved and deprotected with. liquid HF for 1-2 
hours at -5 to 0° C, which cleaves the polypeptide from 
the resin and removes most of the side-chain protecting 

25 groups. Scavengers such as anisole, dimethylsulf ide and p- 
thiocresol are usually used with the liquid HF to prevent 
cations formed during the cleavage from alkylating and 
acylating the amino acid residues present in the 
polypeptide. The formyl group of tryptophan and the 

30 dinitrophenyl group of histidine need to be removed, 
respectively by piperidine and thiophenyl in DMF prior to 
the HF cleavage . The acetamidomethyl group of cysteine 
can be removed by mercury (II) acetate and alternatively by 
iodine, thallium(III) trif luoroacetate or silver 

35 tetraf luoroborate which simultaneously oxidize cysteine to 
cystine. Other strong acids used for tBoc peptide cleavage 
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and deprotection include trif luoromethanesulf onic acid 
(TFMSA) and trimethylsilyltrif luoroacetate (TMSOTf ) . 

A zampl polypeptide ligand-binding polypeptide 
can also be used for purification of ligand. The 
5 polypeptide is immobilized on a solid support, such as 
beads of agarose, cross-linked agarose, glass, cellulosic 
resins, silica-based resins, polystyrene, cross-linked 
polyacrylamide, or like materials that are stable under 
the conditions of use. Methods for linking polypeptides 

h* 10 to solid supports are known in the. art, and include amine 

P 

rj chemistry, cyanogen bromide activation, N- 

il hydroxysuccinimide activation, epoxide activation, 

sulfhydryl activation, and hydrazide activation. The 

£n resulting medium will generally be configured in the form 

m 

15 of a column, and fluids containing ligand are passed 

E 

O through the column one or more times to allow ligand to 

Jjj bind to the receptor polypeptide. The ligand is then 

m eluted using changes in salt concentration, chaotropic 

O agents (guanidine HC1) , or pH to disrupt ligand-receptor 

ru 

20 binding. 

An assay system that uses a ligand-binding 
receptor (or an antibody, one member of a complement/ 
anti-complement pair) or a binding fragment thereof, and a 
commercially available biosensor instrument (BIAcore™, 

25 Pharmacia Biosensor, Piscataway, NJ) may be advantageously 
employed. Such receptor, antibody, member of a 

complement/anti-complement pair or fragment is immobilized 
onto the surface of a receptor chip. Use of this 
instrument is disclosed by Karlsson, J ■ Immunol . Methods 

30 145 : 229-40 , 1991 and Cunningham and Wells, J. . Mol . Biol . 
234 : 554-63 , 1993. A .receptor, antibody, member or 

fragment is covalently attached, using amine or sulfhydryl 
chemistry, to dextran fibers that are attached to gold 
film within the flow cell. A test sample is passed 

35 through the cell. If a ligand, epitope, or opposite 
member of the complement/anti- complement pair is present 
in the sample, it will bind to the immobilized receptor, 



antibody or member, respectively, causing a change in the 
refract.ive index of the medium, which is detected as a 
change in surface plasmon resonance of the gold film. 
This system allows the determination of on- and off -rates, 
from which binding affinity can be calculated, and 
assessment of stoichiometry of binding. 

Ligand-binding receptor polypeptides can also be 
used within other assay systems known in the art. Such 
systems include Scatchard analysis for determination of 
binding affinity (see Scatchard, Ann. NY Acad. Sci . 51: 
660-72, 1949) and calorimetric assays (Cunningham et al., 
Science 253 : 545-48 , 1991; Cunningham et al . , Science 
245:821-25, 1991) . 

Zampl polypeptides can also be used to prepare 
antibodies that specifically bind to zampl polypeptide 
epitopes, peptides or polypeptides. Methods for preparing 
polyclonal and monoclonal antibodies are well known in the 
art (see, for example, Sambrook et al . , Molecular Cloning : 
A Laboratory Manual . Second Edition . Cold Spring Harbor, 
NY, 1989; and Hurrell, J. G. R. , Ed., Monoclonal Hvbridoma 
Antibodies: Techniques and Applications , CRC Press, Inc., 
Boca Raton, FL, 1982) . As would be evident to one of 
ordinary skill in the art, polyclonal antibodies can be 
generated from a variety of warm-blooded animals, such as 
horses, cows, goats, sheep, dogs, chickens, rabbits, mice, 
and rats. 

The immunogenicity of a zampl polypeptide may be 
increased through the use of an adjuvant, such as alum 
(aluminum hydroxide) or Freund's complete or incomplete 
adjuvant. Polypeptides useful for immunization also 
include fusion polypeptides, such as fusions of zampl 
polypeptide or a portion thereof with an immunoglobulin 
polypeptide or with maltose binding protein. The 
polypeptide immunogen may be a full-length molecule or a 
portion thereof. If the polypeptide portion is "hapteri- 
like", such portion may be advantageously joined or linked 
to a macromolecular carrier (such as keyhole limpet 
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hemocyanin (KLH) , bovine serum albumin (BSA) or tetanus 
toxoid) for immunization. 

As used herein, the term "antibodies" includes 
polyclonal antibodies, affinity-purified polyclonal 
5 antibodies, monoclonal antibodies, and antigen-binding 
fragments, such as F{ab'>2 an d Fab proteolytic fragments. 
Genetically engineered intact antibodies or fragments, 
such as chimeric antibodies, Fv fragments, single chain 
antibodies and the like, as well as synthetic antigen- 
□ 10 binding peptides and polypeptides, are also included-. 

Non-human antibodies may be humanized by grafting only 
y, non-human CDRs onto human framework and constant regions, 

; or by incorporating the entire non-human variable domains 

jji (optionally "cloaking" them with a human-like surface by 

15 replacement of exposed residues, wherein the result is a 
T\ "veneered" antibody). In some instances, humanized 

O antibodies may retain non-human residues within the human 

Jij variable region framework domains to enhance proper 

■ binding characteristics. Through humanizing antibodies, 

20 biological half-life may be increased, and the potential 
for adverse immune reactions upon administration to humans 
is reduced. Alternative techniques for generating or 
selecting antibodies useful herein include in vitro 
exposure of lymphocytes to zampl protein or peptide, and 
25 selection of antibody display libraries in phage or 
similar vectors (for instance, through use of immobilized 
or labeled zampl protein or peptide) . 

Antibodies are defined to be specifically 
binding if: 1) they exhibit a threshold level of binding 
30 activity, and/or 2) they do not significantly cross-react 
with related polypeptide molecules. First, antibodies 
herein specifically bind if they bind to a zampl 
polypeptide, peptide or epitope with a binding affinity 
(Ka) of 10 s mol" 1 or greater, preferably 10 7 mol" 1 or 
35 greater, more preferably 10 s mol" 1 or greater, and most 
preferably 10 s mol" 1 or greater. The binding affinity of an 
antibody can be readily determined by one of ordinary 
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skill in the art, for example, by Scatchard analysis (G. 
Scatchard, Ann. NY Acad. Sci. 51 : 660-72, 1949) . 

Second, antibodies specifically bind if they do 
not significantly cross-react with related polypeptides. 
5 Antibodies do not significantly cross-react with related 
polypeptide molecules, for example, if they detect human 
zampl polypeptide, but not known related polypeptides 
using a standard Western blot analysis (Ausubel et al . , 
ibid. ) . Examples of known related polypeptides are 

10 orthologs, that is, proteins from, the same species that 
are members of a protein family, such as other known human 
p-defensins (e.g. , hBD-1 and hBD-2) , mutant human p- 
defensins, and non-human p-defensins. 

Moreover, antibodies may be "screened against" 

15 known related polypeptides, to isolate a population that 
specifically binds to the inventive polypeptides. For 
example, antibodies raised to human zampl polypeptide are 
adsorbed with related polypeptides adhered to an insoluble 
matrix; antibodies specific to human zampl polypeptide 

2 0 will flow through the matrix under the proper buffer 
conditions. Such screening allows isolation of polyclonal 
and monoclonal antibodies non-crossreactive to closely 
related polypeptides (see, Antibodies: A Laboratory 
Manual, Harlow and Lane (eds.), Cold Spring Harbor 

25 Laboratory Press, 1988 ; ' Current Protocols in Immunology, 
Cooligan et al . (eds.), National Institutes of Health, 
John Wiley and Sons, Inc., 1995). Screening and isolation 
of specific antibodies is well known in the art (see, 
Fundamental Immunology, Paul (ed.), Raven Press, 1993; 

30 Getzoff et al . , Adv. Immunol. 43.: 1-98, 1988; Monoclonal 
Antibodies: Principles and Practice, Goding, J.W. (eds.). 
Academic Press Ltd., 1996; Benjamin et . al . , Ann ■ Rev ■ 
Immunol. 2:67-101, 1984). 

A variety of assays known to those skilled in 

35 the art can be utilized to detect antibodies which 
specifically bind to zampl proteins or peptides.- 
Exemplary assays are described in detail in Antibodies: A 
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Laboratory Manual , Harlow and Lane (Eds.), Cold Spring 
Harbor Laboratory Press, 1988. Representative examples of 
such assays include: concurrent Immunoelectrophoresis, 
radioimmunoassay, radioimmuno-precipitation, enzyme-linked 
S immunosorbent assay (ELISA) , dot blot or Western blot 
assay, inhibition or competition assay, and sandwich 
assay. In addition, antibodies can be screened for 
binding to wild-type versus mutant zampl protein or 
peptide . 

Lb 

p 10 Antibodies to zampl polypeptides may be used for 

□ tagging cells that express zampl polypeptides; for. 

* isolating zampl polypeptides by affinity purification; for 

Li diagnostic assays for determining circulating levels of 

zampl polypeptides; for detecting or quantitating soluble 
7 15 zampl polypeptide as a marker of underlying pathology or 

disease; in analytical methods employing FACS ; for 
screening expression libraries; for generating anti- 
Ill idiotypic antibodies; and as neutralizing antibodies or as 
S; antagonists to block anti-microbial activity in vitro and 

2 0 in vivo. 

For pharmaceutical use, the proteins of the 
present invention are formulated for topical, inhalant or 
parenteral, particularly intravenous or subcutaneous, 
delivery according to conventional methods. Intravenous 
25 administration will be by bolus injection or infusion over 
a typical period of one to several hours. In general, 
pharmaceutical formulations will include a zampl protein 
in combination with a pharmaceutically acceptable vehicle, 
such as saline, buffered saline, 5% dextrose in water or 

3 0 the like. Formulations may further include one or more 

excipients, preservatives, solubilizers , buffering agents, 
albumin to prevent protein loss on vial surfaces, etc. 
Methods of formulation are well known in the art and are 
disclosed, for example, in Remington: The Science and 
35 Practice of Pharmacy . Gennaro, ed. , Mack Publishing Co., 
Easton, PA, 19th ed., 1995. Therapeutic doses will 
generally determined by the clinician according to 
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accepted standards, taking into account the nature and 
severity of the condition to be treated, patient traits, 
etc. Determination of dose is within the level of 
ordinary skill in the art. The proteins may be 

administered for acute treatment, over one week or less, 
often over a period of one to three days or may be used in 
chronic treatment, over several months or years. 

The invention is further illustrated by the 
following non-limiting examples. 

Example 1 
Extension of EST Sequence 



01 The novel zampl polypeptides of the present 

y: 15 invention were initially identified by querying an EST 
Q database for homologous sequences to the SAP-1 human 

to defensin isolated from human psoriatic skin. A single EST 

ijj sequence was discovered in a bronchial epithelium cDNA 

O library and was predicted to be related to the P-defensin 

20 family. A' second search based upon the p-defensin 
consensus motif also identified the EST. 

To identify the corresponding cDNA, a clone 
containing the EST was sought, but was not located. 
Oligonucleotides ZC14741 . (SEQ ID NO: 5), ZC14740 (SEQ ID 
25 NO: 6) were used in a PCR reaction to isolate the zampl 
polypeptide-encoding sequence from' human genomic DNA. 
Reaction conditions were 94°C for 1 minute and 30 seconds, 
followed by 35 cycles of 94°C for 10 seconds, 58°C for 20 
seconds and 72°C for 20 seconds, followed by 72°C for ten 
3 0 minutes. As a template, 100 ng of human genomic DNA was 
used, and Clontech Advantage PCR mix (Clontech, Palo Alto, 
California) was used as the polymerase mixture. The 
resulting 113 bp fragment was then purified on a 3.2% 
NuSieve (FMC Bioproducts, Rockland, ME) gel using a 
35 QiaexII Gel Extraction Kit (Qiagen, Inc., Chatsworth, 
California) according to the manufacturer's directions. 
The purified material was used as a template for 
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sequencing. The template was sequenced on an ABIPRISM ™ 
model 377 DNA sequencer (Perkin-Elmer Cetus, Norwalk, Ct.) 
using the ABI PRISM™ Dye Terminator Cycle Sequencing Ready 
Reaction Kit (Perkin-Elmer Corp.) according to 
5 manufacturer's instructions. Oligonucleotides ZC14741 
(SEQ ID NO: 5), ZC14740 (SEQ ID NO: 6) were used as 
primers for sequencing the clone. Sequencing reactions 
were carried out in a Hybaid OmniGene Temperature Cycling 
System (National Labnet Co., Woodbridge, NY). SEQUENCHER™ 

10 3 . 1 sequence analysis software (Gene Codes Corporation, 
Ann Arbor, MI) was used for data analysis. The resulting 
113 bp sequence is disclosed in SEQ ID NO : 1. Comparison 
of the originally derived EST sequence with the sequence 
represented in SEQ ID NO: 1 showed that there were 2 base 

15 pair differences which resulted in 1 amino acid difference 
between the deduced amino acid sequences. Note that one 
of the base pair differences were from unknown "N" 
residues in the EST sequence to known residues in SEQ ID 
NO : 1 . 

20 Generally, one or a combination of several 

techniques could be used to obtain the full length 
sequence of the zampl polypeptide-encoding polynucleotide. 
First, if one or . more additional ESTs are identified that 
contig to the clone sequenced above, clones corresponding 

25 to such ESTs can be ordered and sequenced as described 
above and spliced together with the original sequence to 
form the full length sequence. If a small portion of the 
full length sequence is absent, 5' RACE reactions can be 
done, and the resulting fragments can be sequenced as 

30 described above and spliced together with the original 
sequence to form the full length sequence. Also, one or 
more cDNA libraries can be probed with all or a portion of 
SEQ ID NO: 1 to identify a putative full-length clone. 
Such a full length clone can be sequenced as described 

35 above. 



Example 2 
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Tissue Distribution 

Northerns were performed using Human Multiple 
Tissue Blots from Clontech (Palo Alto, CA) . An 
5 approximately 113 bp DNA probe, based directly on the 
identified EST, was generated using PCR techniques, 
specifically a 35 cycle reaction with an annealing 
temperature of 58°C using Clontech Advantage KlenTaq 
Polymerase mix (Clontech) . The DNA probe was 

£ 10 radioactively labeled with P using REDI PRIME DNA 
j"5 labeling system (Amersham, Arlington Heights, IL) 

il according to the manufacturer's specifications. The probe 

was purified using a NUCTRAP push column (Stratagene 
jl! Cloning Systems, La Jolla, CA) . EXPRESSHYB (Clontech) 

15 solution was used for prehybridization and as a 
rs hybridizing solution for the Northern blots. 

U' Hybridization took place overnight at 55°C, and the blots 

m were then washed in 2X SSC and 0.1% SDS at RT, followed by 

G a wash in 0 . IX SSC and 0.1% SDS at 50°C. No expression was 

20 observed. It thus appears that normal tissue levels of 
mRNA of zampl polypeptide are below the detection 
sensitivity of the Northern blot. Such an observation is 
consistent with the knowledge in the art regarding some 
defensins, i.e. , that they are cons ti tut ively expressed at 
25 low levels but are highly inducible upon infection. 

Example 3 

Chromosomal Mapping of the Zampl Gene 

3 0 The zampl gene was mapped to chromosome 8 using 

the commercially available "GeneBridge 4 Radiation Hybrid 
Panel" (Research Genetics, Inc. , Huntsville, AL) . The 
GeneBridge 4 Radiation Hybrid Panel contains PCRable DNAs 
from each of 93 radiation hybrid clones, plus two control 

35 DNAs (the HFL donor and the A23 recipient) . A publicly 
available WWW server (http://www-genome.wi.mit.edu/cgi- 
bin/contig/rhmapper.pl) allows mapping relative to the 
Whitehead Institute/MIT Center for Genome Research's 
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radiation hybrid map of the human genome (the "WICGR" 
radiation hybrid map) which was constructed with the 
GeneBridge 4 Radiation Hybrid Panel. 

■ For the mapping of the zampl gene with the 
5 "GeneBridge 4 RH Panel", 20 ul reactions were set up in a 
PCRable 96-well microtiter plate (Stratagene, La Jolla, 
CA) and used in a "RoboCycler Gradient 96" thermal cycler 
(Stratagene) . Each of the 95 PCR reactions consisted of 2 
ul 10X KlenTaq PCR reaction buffer (CLONTECH Laboratories, 

10 Inc., Palo Alto, CA) , 1.6 ul dNTPs mix (2.5 mM each, 
PERKIN- ELMER, Foster City, CA) , 1 ul sense primer, ZC 
14,780 (SEQ ID NO: 7), 1 ul antisense primer, ZC 14,776 
(SEQ ID NO: 8), 2 ul "RediLoad" (Research Genetics, Inc., 
Huntsville, AL) , 0.4 ul 50X Advantage KlenTaq Polymerase 

15 Mix (Clontech Laboratories, Inc.), 25 ng of DNA from an 
individual hybrid clone or control and x ul ddH20 for a 
total volume of 20 ul . The reactions were overlaid with an 
equal amount of mineral oil and sealed. The PCR cycler 
conditions were as follows: an initial 1 cycle 5 minute 

20 denaturation at 95°C, 35 cycles of a 1 minute denaturation 
at 95°C, 1 minute annealing at 52°C and 1.5 minute 
extension at 72°C, followed by a final 1 cycle extension of 
7 minutes at 72°C. The reactions were separated by 
electrophoresis on a 2% agarose gel (Life Technologies, 

25 Gaithersburg, MD) . 

The results showed that the zampl gene maps 33.5 
cR_3 000 from the top of the human chromosome 8 linkage 
group on the WICGR radiation hybrid map. Proximal and 
distal framework markers were CHLC . GATA62D10 and WI-3823. 

30 (D8S1511) , respectively. The use of surrounding markers 
positions the zampl gene in the 8p23.3-p23.2 region on the 
integrated LDB chromosome 8 map (The Genetic Location 
Database, University of Southhampton, WWW server: 
http : //cedar . genetics . soton.ac.uk/ public_html/) . 



35 
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Example 4 

Identification of DNA Encoding Full Length Zampl 
Polypeptide and Sequencing Thereof 

5 

The 5 1 end of zampl coding sequence was obtained 
by PCR using GenomeWalker® reagents (Clontech) in 
combination with a zampl polypeptide -spec if ic antisense 
primer ZC15591 (SEQ ID NO: 12) and then conducting nested 

10 PCR with ZC15589 (SEQ ID NO: 13), according to 
manufacturer's instructions, with the exception that 64 C C 
was used in the primary reaction instead of the suggested 
67°C. PCR products were run on a 2% agarose gel (Gibco) , 
and gel purified using Qiaex II reagents (Qiagen) 

15 according to manufacturer's instructions. Products were 
sequenced using ZC15589 (SEQ ID NO: 13) as a sequencing 
primer, revealing the extension of zampl polypeptide- 
. encoding to a putative initiation methionine (nucleotides 
1-201 or 1-219 of SEQ ID NO: 9) and about 250 base pairs of 

20 5' untranslated sequence. 



25 Example 5 

Synthesis of Zampl 

\ A 45 'amino acid residue zampl peptide (residues 

23 to 67 of SEQ ID NO: 2) was synthesized by solid phase 
30 peptide synthesis using a model 431A Peptide Synthesizer 
(Applied Biosystems/Perkin Elmer, Foster City, CA) . Fmoc- 
Lysine(Boc) resin (oNs2 mmol/g; Anaspec Inc., San Jose, 
CA) was used as the initial support resin. 1 mmol Amino 
acid cartridges (Anaspec Inc., San Jose, CA and Applied 
35 Biosystems/Perkin ElmerA Foster City, CA) were used for 
synthesis. 2- ( 1-H-benzotriazol-l-yl) -1 , 1 , 3 , 3 - 

tetramethyuroniumhexaf luoitophosphate (HBTU) , 1 -Hydroxy - 
benzotriazole (HOBt) , 2 M N , N-Diisopropylethylamine , N- 
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Methylpyrrolidonev, Dichloromethane (all from Applied 
Biosystems/Perkin \Elmer, Foster City, CA) , along with 
piperidine (Aldrich Vhemical Co., St. Louis, MO) and 0.5 
M acetic anhydride capping solution (Advanced ChemTech, 
5 Louisville, KY) , were ufeed as synthesis reagents. 

The Peptide Companion software (Peptides 
International, Louisville, KY) was used to help predict 
the aggregation potential for the synthesis for zampl . 
H Synthesis was performed using both single and double 

p 10 coupling cycles. Also, acetylation was used where 

'•' difficult couplings were predicted. 

L. The peptide was cleaved from the solid phase by 

B« the standard TFA cleavage procedure (according to Peptide 

Cleavage manual, Applied Biosystems/Perkin Elmer) . 
15 Purification of the peptide was done by RP-HPLC using a 
: j' C18, 52mm x 250mm, preparative column (Vydac, Hesperia, 

Lq CA) . Fractions from the column were collected and 

- analyzed for the correct mass by electrospray mass 

spectrometry; the purity was analyzed by analytical RP- 
20 HPLC, using a C18, 4.6mm x 250mm column (Vydac, Hesperia, 
CA) . The mass spectrometry analysis confirmed the desired 
molecular weight of the reduced form of zampl, i.e., 
5158. Purified fractions were frozen and then 

lyophilized. 

25 The reduced peptide was dissolved in 6 M 

guanidine HC1 (Aldrich Chemical Co.) at an initial 
concentration of 2 mg/ml . This solution was then added 
slowly to 2.1 volume equivalents of 1 M guanidine HC1 
along with 0.52 volume equivalents of DMSO (Aldrich 

30 Chemical Co.). The oxidation was monitored with 

analytical RP-HPLC using the same analytical C18 column; 
the oxidation was -complete at 48 hours. Salts were 

removed from the reaction mixture using solid phase 
extraction C18 cartridges (Waters, Milford, MA) . The 

35 eluant containing the oxidized peptide is concentrated and 
then purified using RP-HPLC semi-prep C18 column (Vydac, 
Hesperia, CA) . Four distinct peaks were determined to 
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correspond to the fully oxidized form of zampl by 
electrospray LCMS. The peak referred to as peak 2, as it 
was the second peak to elude by RP-HPLC, was found to 
contain the conserved defensin disulfide pattern, by a 
process of elimination using partial digest and peptide 
mapping of all four peaks. This peak was isolated, frozen 
and lyophilized. 

From the foregoing, it will be appreciated that, 
although specific embodiments of the invention have been 
described herein for purposes of illustration, various 
modifications may be made without deviating from the 
spirit and scope of the invention. Accordingly, the 
invention is not limited except as by the appended claims. 



